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Colonial  control 

information  available 

for  Inuit  family 

research 

Disko  Bay  church  and  census 
records 


The  following  three  papers  were  presented  at 
the  IASSIST  '87  conference  in  a  plenary 
session  entitled  Research  trends  .  The  object  of 
the  session  was  to  focus  on  the  effect  of  new 
trends  in  research  and  data  collection  and  the 
advance  of  technology  on  the  use  of  data  and 
data  management  techniques. 


by  Per  Nielsen^ 
Danish  Data  Archives 


'Presented  at  the  International  Association  for 
Social  Science  Information  Service  and 
Technology  (IASSIST)  Conference  held  in 
Vancouver,  British  Columbia,  Canada,  Mav 
19-22,  1987 

^I  am  indebted  to  Kirsten  Elisabeth  Caning  for 
the  articles  she  sent  me,  and  to  Jens  Ludvig 
Wagner  at  the  DDA  for  assistance  and  support 
when  I  tried  to  get  an  overview  of  the  project 


1.    Historical  Sources  from  Greenland 

Indigenous  people  imder  colonial  rule  were  not 
left  in  peace!    This  was  not  because  of  any 
sincere  interest  in  the  ethnic  peculiarities  of  the 
people,  but  rather  because  bureaucratic 
registration  also  served  the  purpose  of  ensuring 
that  the  path  of  development  was  in  accordance 
with  the  stipiilations  of  the  colonizing 
government 

In  the  case  of  the  Eskimo  population  in 
Greenland,  the  Government  was  in  Copenhagen, 
where  difTerent  authorities  requested  very 
detailed  information  on  personnel  resources, 
production,  buildings,  etc.    In  most  cases,  the 
information  was  required  in  order  to  evaluate 
economic  development;  however,  even  health 
conditions,  births  and  deaths,  including  reasons 
for  death,  were  reported.    And  in  realm  of  the 
church,  baptisms,  confirmations,  weddings,  and 
burials  were  reported  to  Copenhagen. 

Prior  to  1774,  the  "colonial  initiative"  was 
mainly  a  private  one,  consisting  of  tradesmen  on 
one  hand,  missionaries  on  the  other.    In  1774, 
the  main  administrative  tasks  were  centralized 
within  Royal  Greenland  Trade,  leaving  the 
chiu-ch  to  take  care  of  its  own  affairs.    From 
1782  to  1950,  Greenland  was  administratively 
divided  into  two  Inspectorate  Regions  ,North 
and  South,  each  with  its  own  Inspector;  the 
division  line  was  located  between  Egedesminde 
and  Holsteinborg.    Needless  to  say,  there  were 
modifications  introduced  to  this  system  during 
the  last  decades  of  the  19th  century;  and  many 
administrative  changes  took  place  during  the 
first  half  of  the  20th  century.    However,  a 
detailed  description  of  these  is  beyond  the  scope 
of  this  short  introduction.' 


■'The  above  outline  is  based  on  Kirsten 
Elisabeth  Caning,  "Personalhistoriske  kilder  i 
gr0nlandske  arkiver."    Personalhistorisk  tidsskrift 
1979. 
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During  the  period  preceding  1951.  all  church 
matters  as  well  as  most  issues  concerning 
education  were  governed  by  the  same 
administrative  unit  in  Copenhagen.    It  is  from 
this  clerical  administration  that  most  of  the 
information  dealt  with  in  this  article  originates. 


2.    The  Church  Records 

ChtiTch  records  were  maintained  in  the 
Greenland  parishes  during  the  first  half  of  the 
18th  century;  most  of  them  have  disappeared, 
and  a  couple  of  preserved  manuscripts  are 
incomplete  in  that  they  include  information  on 
the  top  strata  of  the  Greenland  poptilation  only. 
From  the  second  half  of  the  18th  century,  there 
are  preserved  church  records  from  Upemavik, 
Godhavn  and  Egedesminde.    The  lists  of  those 
baptized  (Christening  Lists)  represent  the 
majority  of  entries  in  the  early  years.    The 
salvation  of  the  souls  of  the  heathens  was  one 
of  the  major  preoccupations  of  missionaries; 
consequently,  the  number  of  baptisms  performed 
was  an  indication  of  the  efficiency  of  the 
missionaries. 

The  church  records  from  the  beginning  of  the 
19th  centur>-,  held  in  the  Danish  archives,  are 
more  complete,  especially  those  from  many  of 
the  parishes  in  the  North  Greenland 
Inspectorate.    The  records  from  the  South 
Greenland  Inspectorate  are  not  so  complete,  in 
part  due  to  the  loss  of  a  lot  of  these  documents 
during  their  transportation  to  Denmark  aboard 
the  ship  "Hans  Hedtoft"  (Januar>-  1959)." 

The  church  records  which  have  been  preserved 
consist  of  four  separate  types  of  lists 
representing  four  different  types  of  events: 


Christening  Lists,  Confirmation  Lists,  Marriage 
Lists,  and  Burial  Lists.    Concommitandy,  the 
data  sets  stored  at  the  DDA  (the  "raw  data") 
are  separated,  in  their  initial  form,  into  4 
different  files: 

■  DDA-0311:  Population  History  of  Greenland 
1800-1930:  Christening  Lists 

■  DDA-0312:  Population  History  of  Greenland 
1800-1930:  Confirmation  Lists 

■  DDA-0313:  Population  History  of  Greenland 
1800-1930:  Marriage  Lists 

■  DDA-0314:  Population  History  of  Greenland 
1800-1930:  Burial  Usts 


"In  the  "Danish  Titanic"  case,  "Hans  Hedtoft" 
collided  with  an  iceberg  on  her  maiden  vovage 
to  Greenland,  January  30th,  1959.  All  95  crew 
members  and  passengers  died. 


3.    The  Census  Records 

The  missionaries  were  also  responsible  for  the 
registration  of  all  people  in  their  districts 
(including  those  who  were  not  Christians,  and 
who,  consequently,  never  appeared  in  the 
Church  Records).    The  censuses  in  Greenland 
contained  the  same  information  as  those  in 
Denmark:  name,  age,  occupation,  and  posiuon 
in  household.    In  addition  to  these,  the 
Greenland  censuses  contained  information  on 
race  (Eskimo,  mixed  and  European)  as  well  as 
an  indication  of  whether  or  not  the  person  had 
been  baptized. 

At  the  National  Archive  in  Copenhagen,  there 
are  census  records  from:  1834,  1840,  1845,  1850, 
1860,  1870.  1901,  and  1911.    Those  from  the 
Umanak  and  Disko  Bay  Districts,  for  all 
censuses  except  the  1911  census  (because  of  the 
80-year  access  restriction  to  personal  records  at 
the  National  Archives),  have  been  made 
computer-readable  and  are  available  as: 
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DDA-0645;  Population  History-  of  Greenland 
1800-1930:  Census  Lists 


4.    The  Data  Collection  Process 


3.    Geographic  Coverage  and  Time  Period 
Included  in  the  Datasets 

In  the  initial  definitive  stages  of  the 
"Demographic  Histoi}-  of  Greenland,  1800-1930" 
research  project,  both  geography  and  time 
periods  had  to  be  taken  into  account    The 
above  outlined  existence  of  nearly  complete 
archival  series  of  church  records  and  census 
records  in  Archives  situated  in  or  near 
Copenhagen,  has  some  bearing  on  the 
geographic  regions  and  time  periods  to  be 
covered. 

Geographically,  the  North  Greenland 
Inspectorate  had  the  most  complete  records. 
Consequently,  the  rather  isolated  regions  in 
northwest  Greenland,  the  Disko  Bay  District  and 
the  Umanak  District  (isolated  in  terms  of  in- 
and  out-migration  from/to  the  sunotmding 
regions,  so  that  few  individuals  "disappear"  via 
migration)  were  selected;  from  north  to  south 
(or  rather,  "around  the  bay")  in  the  Umanak 
and  Disko  Bay  Districts  thus  defined,  Umanak, 
Godhavn,  Ritenbenk,  Jakobshavn,  Christianshab, 
and  Egedesminde  are  the  larger  township  areas 
included  in  the  project 

With  respect  to  time  period  covered,  the 
availability  of  records  suggests  that  the 
registration  should  begin  around  the  mm  of  the 
18th  century,  i.e.  aroimd  the  year  1800.    For 
reasons  of  restricted  access  (and,  of  course,  as  a 
means  of  reducing  the  resources  required  for 
the  data  collection  process),  the  registration  was 
stopped  at  the  beginning  of  the  20th  century. 


Coding  was  done  from  the  original  sources,  or 
copies  thereof  by  the  principal  investigator, 
Kirsten  Caning,  together  with  a  smdent  aide 
(see  figure  1).^  The  coding  sheets  were  then 
sent  out  to  have  the  information  transformed  to 
a  computer-readable  medium.    The 
machine-readable  data  were  deposited  with  the 
DDA,  where  Jens  Ludvig  Wagner  (who  was 
experienced  in  demographic  data  and  family 
reconstitution  methods  from  his  work  with 
several  other  historical  projects)  took  over  the 
transformation  tasks  as  they  were  requested  by 
Ms.    Caning.'  During  this  process,  the  following 
rectangular  files  were  generated: 

■  DDA-0311: 17,248  Baptism  entries,  each  with 
up  to  53  variables 

■  DDA-0312:  10,037  Confirmation  entries,  each 
with  up  to  62  variables 

■  DDA-0313:    4,720  Wedding  entries,  each 
with  up  to  90  variables 

■  DDA-0314:  13,217  Burial  entries,  each  with 
up  to  84  variables 

■  DDA-0645:  23,953  Census  entries,  each  with 
up  to  37  variables 


These  five  files  form  the  basic  "raw  data"  of 
the  project    However,  after  this  initial  collection 
of  raw  data,  a  tremendous  effort  was  devoted  to 
data  correction  and  family  reconstitution. 
Correction  involved,  for  example,  the  deletion  of 
double  entries  (e.g.  the  baptism  of  the  same 
child  in  two  parishes  in  the  Church  Records,  or 


^[figures  and  tables  are  collected  together  at  the 

end  of  the  article.    Ed.  note] 

'See  Jens  Ludvig  Warner.  "Datamaterialer  med 

komplekse  strukturer",  in  DD.^-Nvt  25:64-70, 

1983. 
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enumeration  of  the  same  person  in  two 
households  in  the  Census  Records).    Those  with 
experience  in  family  reconstitution  projects  know 
the  amount  of  preparatory  work  to  be  done 
before  one  can  produce  a  centralized  file. 
Therefore,  I  shall  go  into  some  detail 
concerning  the  methodology  applied  in  this 
particular  family  reconstitution  project.    How  did 
we  derive,  from  the  above  listed  five  raw  data 
files  within  which  all  the  events  were 
sequentially  numbered,  the  so-called  "centralized 
file"  containg  the  best  possible  reconstitution  of 
famihes? 


5.    Family  Reconstitution:  Manual  and/or 
Automatic  Procedures 

Within  the  discipline  of  historical  demography, 
there  has  been  quite  a  long  tradition  of  two 
"schools",  one  advocating  automatic  (i.e. 
computer  based)  family  reconstitution 
procedures,  the  other  maintaining  that  a  lot  of 
human  thinking  is  necessary  in  order  to  get  as 
close  to  the  ideal  of  a  complete  reconstitution  as 
possible.    In  the  case  of  the  data  from 
Greenland,  a  mixed  automatic  and  manual 
process  was  used. 

With  the  raw  data  (or  basic  files)  as  the  point 
of  depanure,  two  tasks  had  to  be  completed: 
that  of  identifying  the  events  (e.g.  a  baptism, 
confirmation,  wedding,  burial,  or  membership  in 
a  certain  household  in  a  specified  census 
enumeration)  as  attributes,  or  descriptors, 
characterizing  the  "central  persons"  (or  "actors") 
when  the  event  took  place,  and  that  of 
establishing  "pointers"  among  all  the  central 
persons  based  on  their  family  relationships. 

Who  are  the  Central  Persons  (CPs)  in  this 
project"  -  In  the  event  of  a  baptism  or  a 
confimiaiion,  the  baptized/confirmed  person  and 
his/he.'-  parents  are  the  CPs;  in  the  event  of  a 


wedding,  bride  and  groom  as  well  as  parents  of 
both  of  the  married  persons  are  CPs;  when  the 
event  is  a  burial,  the  buried  person  and  his/her 
parents  as  well  as  a  spouse  and  children  are 
CPs.    Finally,  in  census  enumerations,  each 
person  listed  in  each  census  is  a  CP.    By 
machine  generation,  the  Basic  Files  were  thus 
expanded  into  a  "Theoretical  Centralized  File" 
with  more  than  200,000  "central  person  in  one 
event"-combinations.    We  shall  call  these 
Central  Person-Event  records  (CPE).    A 
sequential  nimiber  was  allocated  to  each  CPE. 

The  CPE-file  was  sorted  on  names;  some 
auxilliary  lists  sorted  on  districts  and  other 
criteria  were  produced  by  Jens  Wagner 
whenever  Kirsten  Caning  needed  and  requested 
such  lists  during  the  work.    The  sorted 
CPE-file(s),  written  out  as  long  paper  listings, 
formed  the  basis  for  the  next  major  state:  the 
process  of  manually  inserting  Personal 
Identification  Nimibers  (PIN),  (see  fiqiue  2). 

Even  though  the  sorting  based  on  names  did 
help  a  lot,  the  task  of  numbering  more  than 
20,000  individual  persons  (as  it  turned  out  later) 
on  more  than  200,000  CPE-records  was  a  heavy 
consumer  of  both  time  and  human  memory! 
And  now  we  may  return  to  the  question 
whether  the  machine  or  the  manual 
reconstitution  is  "better",    ll  goes  without  saying 
that  the  choice  of  method  depends  heavily  on 
the  nature  and  quality  of  the  raw  data;  so  we 
shall  talk  about  advantages  and  disadvantages, 
(see  fiqure  3). 

.Automatic  family  reconstitution  has  the 
advantage  of  being  relatively  quick  and  being 
well  documented.    However,  in  many  cases  a  lot 
of  persons  are  left  "split"  in  two  or  more 
persons.    For  example,  figures  1  and  2 
(1901-census)  compared  to  the  extensive  figure 
3  (an  excerpt  from  the  CPE-file  after  all  CPs 
have  been  numbered)  clearly  demonstrates  a 
number  of  problems  with  machine 
reconstitution:  Nikolaj  Jens  Andreas  Lange  was 
found  under  many  different  names,  partly  due 
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to  the  fact  that  the  first  names  may  be  written 
in  any  order  (some  of  them  may  be  missing), 
partly  because  of  different  spellings  (some  of 
which  are  not  caught  in  the  normalization 
process,  e.g.    Nik(olaj)  vs.    Nic(olaj)).    Figures  2 
and  3  show  that  Kirsten  Caning  preferred  to 
work  with  "normalized  names"  in  order  to 
minimize  the  spelling  problem.    But  even  with 
that  precaution,  the  decision  was  made  to 
perform  the  reconstitution  manually. 

The  reconstitution  process  was  carried  out  by 
entering  PIN-numbers  on  all  relevant  locations 
in  the  CPE-file.    During  that  process,  a  number 
of  errors  were  detected  and  corrected  -  this  is 
probably  one  of  the  main  advantages  of  manual 
reconstitution.    After  some  iterations  of 
PIN-ntimbering,  the  theoretical  CPE-file  was 
reduced  from  more  than  200,000  records  to 
approximately  131,000  CPE-records. 


6.    The  Documentation  Problem  in  Manual 
Family  Reconstitution 

Who  "disappeared"  from  the  theoretical 
CPE-file?    Approximately  70,000  entries 
vanished  from  the  "theoretical"  to  the 
"reduced"  version  of  the  CPE-file.    The  records 
of  type  "own  baptism"  were  reduced  from 
17248  to  16312;  "own  confirmations"  were 
reduced  from  10037  to  9169;  "own  marriage" 
from  9442  to  8388;  and  "own  btmal"  from 
13217  to  11678.    Because  some  census 
registrations  had  been  enoneously  omitted  from 
the  first  data  entr)-  process,  the  number  of 
census  registrations  increased  from  the  original 
23953  to  24076.    (The  first  number  is  the  size 
of  file  given  in  section  4  for  Raw  Data  Files). 

Theoretically,  one  might  expect  that  even- 
person  involved  in  a  baptism,  confirmation, 
marriage,  or  burial  would  have  two  parents. 
However,  these  CPs  have,  in  many  instances. 


not  been  identified  in  the  source  records. 
Therefore  the  number  of  CPE-records  in  which 
parents  were  registered  in  each  of  the  four 
events  was  reduced  from  the  "theoretical" 
numbers  34496,  20074,  18884  and  26434,  to 
31338,  13538,  4902  and  9181  respectively. 
Predictably,  it  was  in  the  events  in  which  the 
"main  actors"  were  adults  (marriages  and 
btirials)  that  the  information  on  parents  was 
most  commonly  missing. 

There  is  a  major  difference  between  the 
theoretical  expectation  that  there  was  a  spouse 
for  each  person  buried  (i.e.  13217  spouses)  to 
the  actual  2328  cases  in  which  a  spouse  was 
actually  identified.    This  is  due,  to  a  great 
extent,  to  the  fact  that  many  burials  were  of 
children  (high  infant  mortality  rate),  and  also  in 
part  to  the  death  of  immarried  adults;  also 
some  of  the  missing  spouses  were  due  to  lack 
of  registration.    Finally,  theoretically,  it  was 
expected  that  there  would  be  information  on  at 
least  one  child  for  each  buried  person  (i.e.  an 
expectancy  of  13217  persons);  however,  it 
turned  out  that  this  information  was  available  in 
the  source  records  only  in  56  cases. 

Does  this  mean,  now,  that  we  have  lost  the 
information  in  those  of  the  CPE-records  that 
were  not  merely  theoretical,  but  in  which  the 
information  was  not  given  in  the  sotirce  records? 
It  does  not! 

If  a  person  was  buried  who  was  married  at  the 
time  s/he  died  without  an  indication  thereof  in 
the  burial  record,  we  have  data  about  the 
marriage  from  a  difTerent  source.    Similarly,  if 
one  or  both  parents  of  a  baptized  person  were 
not  mentioned  in  the  Baptism  File,  we  still  may 
know  of  them  from  other  sotirces,  e.g.  from  a 
relevant  census.    This  leads  us  on  to  the  "record 
linkage",  which  in  this  project  was  done  in  an 
ad  hoc  data  system. 
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7.    Finalization  of  the  Family  Reconstruction  in 
a  Data  Base  Environment 

During  1984  and  1985,  several  MLD/ASTRID' 
data  bases  were  designed  and  tested  on  subsets 
of  the  data  from  the  Greenland  ProjecL    Even 
though  a  number  of  these  designs  were 
acceptable  from  the  substantive  point  of  view, 
they  had  to  be  rejected  because  they  would 
consume  too  much  computer  power  when 
loaded  with  the  full  data  base.  ' 

With  further  computations  and  tests,  a  data  base 
design  was  finalized  by  the  end  of  1984'  and 
implemented  with  all  the  data  from  the  project 
The  basic  design  of  the  data  base  is  that  one 
part  contains  all  PERSONS  and  another  pan 
contains  all  the  EVENTs.    Each  person  is 
identified  by  the  PIN-number  preceded  by  M 
(males)  or  K  (females).    There  is  no  need  for  a 
lot  of  pointers  because  the  searching  is  based 
on  key  fields  in  PERSONS  and  EVENTs 
respectively. 

I  shall  not  go  into  the  technical  details  of  the 
data  base  in  this  paper.    However,  1  shall 
demonstrate  the  output  from  the  data  base  using 
the  two  heads  of  household  that  we  have 
followed  in  figures  1-3  above,  (refer  to  fiqure 
4). 

As  can  be  seen  from  figure  4,  not  much  has 
been  done  to  present  the  output  in  an  easily 
understandable  way;  thus  far,  only  people 


'MLD/ASTRID  is  a  data  base  language  and 
system,  defined  by  J0rgen  Grosb0l  at  DDA, 
described  in  his  manual:  MLD  Data  Bases  and 
the  ASTRID  Language.    CHenseTDDXT^Sr" 
'The  basic  design  was  described  in  ICarsten 
Boye  Rasmussen,  "MLD/ASTRJD  database  for 
Gr0n]ands  befolkningshistorie".  Working  Paper 
A460-KB.    DDA  1984-07-31. 
'Jens  Wagner,  "MLD/ ASTRID  database  for 
Grpnlands  befolkningshihistorie  -  design  uden 
referencer",  Working  Paper  A460-JW.  DDA,  10. 
December  1984. 


actively  involved  in  the  project  have  used  the 
system,  and  they  know  what  the  encryptions 
mean.    If  the  data  base  system  were  to  be  used 
by  others  outside  the  project,  more  text  would 
have  to  clarify  the  output 

Using  this  data  base,  the  principal  investigator  is 
presently  finalizing  the  family  reconstitution 
task.    Also,  special  purpose  functions  have  been 
designed  for  demonstration  purposes;  for 
example,  it  is  quite  easy  to  establish  pointers  to 
allows  a  user  to  move  up  and  down  along 
genealogical  lines,  drawing  the  complete 
pedigree  of  the  person  under  analysis. 

It  should  be  noted  here  that  several  features 
that  are  specific  to  the  Inuit  culture  are 
refiected  in  the  data.    People  may  be  referred 
to  their  biological  family,  to  an  extended  family 
or  household,  or  to  the  particular  house  in 
which  they  lived.    It  was  quite  usual  for  the 
Eskimo  to  live  in  so  called  longhouses.Several 
households  lived  in  the  same  longhouse  during 
the  winter  (when  censuses  were  taken);  during 
the  summer,  they  might  move  away  from  the 
house,  living  in  tents  or  similar  dwellings.    The 
following  year,  the  family  might  move  to  a 
different  longhouse  at  the  same  or  at  a  different 
location;  the  houses  were  not  owned  by  anyone 
-  or,  rather,  they  were  owned  by  whoever 
happened  to  be  occupying  them.    This  tradition 
of  living  together  in  longhouses  included  a 
social  security  aspect;  the  data  show  that  this 
tradition  was  slowly  abandoned  in  favour  of 
nuclear  family  houses  diuing  the  19th 
century.'" 

With  the  huge  mmiber  of  cross-identifications 
in  the  data,  it  is  possible  to  follow  individuals 
or  families  (biological  or  extended)  for 
generations.    Alternately,  it  is  possible  to 
examine  a  single  location,  and  to  describe  how 
life  changed  in  that  particular  location  over  the 


'"Described  in  Kirsten  Caning,  "Era 
bofasllesskaber  til  kasmefamihe",  Beretning  fra 
Carlsberfondet  K0benhavn  1986. 
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years.    The  latter  strategy  was  applied  by  the 
principal  investigator  in  an  article  on 
Sermermiui  -  a  small  place  with  only  two 
houses  with  20  and  12  inhabitants  respectively 
(3  households  per  house).  ^' 


8.    Access  to  the  Datasets  for  Secondarj 
Analysis 

Despite  the  fact  that  the  primar>-  investigator 
has  not  yet  finalized  the  troublesome 
reconstituition  process,  it  will  be  possible  for 
secondar>-  analysts  to  have  access  to  the  data, 
with  the  consent  of  Kirsten  Caning.    All 
requests  should  be  directed  to  the  DDA. 

Needless  to  say,  it  is  not  quite  as  simple  to 
address  a  user  requesting  this  type  of  data  as  it 
is  to  send  out  a  survey  file.    The  user  must 
define  a  priori  the  subsets  or  the  formats  s/he 
can  handle.    Further,  it  is  the  nature  of 
historical-demographic  data  of  the  txpe  we  have 
been  discussing  that  their  quality'  improves  over 
the  years;  during  analysis,  new  findings 
concerning  data  relationships  can  be  added. 
Such  changes  are  now  reflected  in  the  data  base 
version  of  the  data,  but  not  in  the  Raw  Data 
Files  mentioned  in  section  4. 

Consequently,  disseminable  versions  of  the  data 
should  be  extracted  from  the  data  base 
according  to  the  specifications  of  the  individual 
user.    With  the  data  stored  in  a  data  base 
management  system,  it  is  possible  to  generate 
rectangular  files  that  can  be  analyzed  with 
standard  statistical  software.    However,  to  get 
the  full  personal  histor.  description  capabilities, 
a  data  base  management  system  environment 
would  be  needed  by  the  user.n 


"Tinna  Mobjerg  and  Kirsten  Caning, 
"Sermermiui  in  the  Middle  of  the  Nineteenth 
Centurv",  Arctic  Anthropologv  vol. 
23{l-2):nTm,  1986. 
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Figure  1:  Xerox  copy  of  one  of  the  source  materials,  viz.  the  Census 
List  from  October  1901,  with  information  from  Saraqaq  in  Ritenbek.  The 
upper  household  is  that  of  Nikolaj  Jens  Andreas  Lange,  living  with  his 
wife  and  4  unmarried  children  aged  between  21  and  30.  The  lower  house- 
hold is  an  older  son,  Lars  Jonas  Lange,  with  his  wife  and  4  children. 


hl!"^iu''^"^'^  u-  ^<^P;"„du«.d  from  an  illu'^tialjon  in  Kirsicn  BiMhcth  Canine    "Om  den  eraniand^ke 
belolknmgs  hisione".  printed  in  l-orsknine  i  Gipniand  1/82.  p.  .S.  ^'  gmniandske 
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Figure  2:  The  same  persons  as  those  listed  in  figure  1,  now  listed  from 
the  computer.  To  the  left  of  the  "normalized"  names  field,  lots  of  num- 
bers of  identification  appear,  i.a.  the  PIN-number  appearing  in  columns 
7-12.  Our  head  of  household  Nikolaj  Jens  Andreas  Lange  has  PIN-code  no. 
12914  -  in  the  Ritenbenk  District  (code  4  in  col.  19),  the  1901-census 
(code  28  in  cols.  16-17)  from  which  the  information  was  taken.  He  has 
that  number  in  all  other  datasets  as  well,  which  we  shall  see  later. 
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figure  4 


•PERSON:     H01291X 
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1(009951894081520500    <  026921  889072  51  0  500    1(0310*1880072910502 
K03187187A05302n502    1(0*5011842032801000    K051  391  866031  *20500 
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D16835187803O310O02  0  0658018  7*0104  10502    » 1 §' !« J g ^§043020502 

D173721 86on317i0502  KOOS751 892031 31 0662    K009801 89203 1 320662 
1(009951894081520500 
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miiwmmmm 

S700491 83*000003001 

1705081854000003001 
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T708581 87000000150? 
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BARN  1(040732    H01291* 

BARN  BH012903    M012886    H012916    K062386    K061175 
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6020551898111320012    0  0207119  00111120012    0101801867040201000 
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V033851930101010502 


Figure  4:  The  information  in  the  PERSON  archive  on  the  same  two  heads 
of  household  as  reflected  in  earlier  figures.  The  person  is  identified 
by  the  PIN-code  preceded  by  M  (males)  or  K  (females);  the  second  line 
gives  the  parents,  third  (and  forth  if  necessary)  the  children.  Under 
the  heading  GIFT  (=  married)  the  spouses  are  identified.  Finally,  under 
the  heading  TRANSDATA  there  are  references  to  all  events  in  which  the 
individual  was  considered  a  CP.  The  initial  letters  identifying  events 
are:  B  for  burials,  D  for  baptisms,  K  for  confirmations,  S  for  censuses 
with  the  extended  family  as  the  unit,  T  for  censuses  with  a  biological 
family  as  the  unit,  and  V  for  the  weddings. 
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Canadian  social 
trends: 

a  new  Statistics  Canada 
social  statistics  publication 


by  D.  Craig  McKie,  Ph.D.' 

Chief,  Social  Reporting  &  Editor,  Canadian 

Social  Trends 

Housing,  Family  and  Social  Statistics  Division 

Statistics  Canada 


In  the  last  ten  years  at  Statistics  Canada,  the 
governmental  collector  and  keeper  of  the 
nation's  storehouse  of  numerical  information 
about  itself,  momentous  changes  have  occurred 
in  the  ways  data  are  collected,  processed, 
analyzed  and  published.    It  is  not  recogniiably 
the  same  organization  I  first  joined  in  1977. 
Since  that  time,  the  widespread  importation  of 
new  methods  and  new  technology  have  in  ttim 
been  refiecled  in  organizational  changes. 
Accompanying  these  changes  have  been 
pressures  to  reduce  costs  and  overall  staff  levels 
and  to  recover  a  larger  part  of  the  costs  of 
operadon  from  clients. 

While  the  technical  aspects  of  this  change  are 
perhaps  better  known  (and  we  could  refer  to 
random  digit  dialing-based  interviewing 
techniques,  a  microcomputer  on  every  other 
desk,  and  very  sophisticated  analytic  software 
and  graphics  hardware  to  match  it),  changes  on 
the  publishing  side  have  also  been  considerable. 
But  while  the  necessity  for  modernizing  the 
dissemination  of  data  has  been  well  recognized 
(by,  for  instance  the  widespread  acceptance  and 
use  of  electronic  bulletin  board  type  data 
services  and  the  Bureau's  own  service  CANSIM, 
and  by  sales  of  data  on  floppy  disks,  user  tapes, 
and  other  similar  services),  publishing  has  not 
received  quite  the  same  amoimt  of  attention. 
The  Bureau,  for  instance,  still  publishes  many 
strange  and  exotic  publications  such  as:  the 
Cereals  and  Oilseeds  Review,  Gas  Utilities,  The 
Dairy  Review,  and  last  but  not  least.  Sawmills 
East  of  the  Rockies. 


'Presented  at  the  International  Association  for 
Social  Science  Information  Service  and 
Technology  (IASSIST)  Conference  held  in 
Vancouver,  Briush  Columbia,  Canada  on  May 
19-22,  1987 


However,  the  number  of  users  of  whatever 
origin  who  are  ready  to  plow  through  table 
upon  table  to  find  a  single  figure  -  or  to 
construct  their  own  annual  dme  series  from  25 
years  of  monthly  pamphlets  is  limited.    Further, 
many  users  have  neither  the  time,  the  training, 
nor  the  inclination  to  do  such  work. 

Those  who  decline  to  engage  in  such  activities 
are  legion.    Some  telephone  me;  undoubtedly 
some  may  telephone  some  of  you.    Indeed,  at 
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the  recent  meetings  of  the  Population 
Association  of  America  in  Chicago,  one  eminent 
demographer,  who  shall  remain  tmnamed,  stated 
to  one  of  my  colleagues  that  it  was  easier  to 
obtain  Zambian  data  than  to  get  it  from 
Statistics  Canada. 

Such  reactions  are  not  tmcommon.    Foreign 
nationals,  or  the  inexperienced,  simply  do  not 
know  where  to  begin.    Other  domestic  users, 
although  they  know  how  to  begin,  lack  the  iron 
will  necessary  to  persist  in  their  quesL    They 
tend  to  fall  into  several  broad  categories  of 
persons.    Three  prominent  types  are: 

1.  those  who  are  interested  in  a  subject  but 
lack  the  knowledge  of  how  to  go  about 
finding  suitable  data  and  who  are  intimidated 
by  the  traditional  publication  of  official 
statistics  [massive  cryptic  tables  with  lots  of 
footnotes,  runic  symbols,  and  qualifications 
which  seem  meant  to  undermine  any  simple 
interpretation  of  the  contents] ; 

2.  those  who  know  what  they  want  and  where 
to  find  it  but  cannot  afford  the  time  to 
assemble  the  data  nor  interpret  it  [there  are 
few  who  do  not  shudder  at  the  task  of 
sorting  through  52  pages  of  tables,  the 
numbering  of  which  may  change  from  year 
to  year,  over  enough  years  to  assemble  a 
graph  which  can  indicate  the  direction  of 
change,  perhaps  standardized  to  some  rate 
per  population] ; 

3.  those  who  don't  want  data  at  all  but  rather  a 
prose  summary  of  a  data  set  to  use  as 
evidence. 


necessary.    Our  subsequent  activities  in  search 
of  such  a  format  have  been  based  on  the 
premise  that  there  is  a  significant  demand  from 
a  wide  range  of  governmental  and 
non-governmental  agencies  for  comprehensive 
and  systematic  information  which  identifies, 
describes,  and  analyzes  social  trends  and 
conditions  in  Canada.    This  demand  is  expressed 
in  a  number  of  ways  including  requests  to 
Statistics  Canada  for  assistance,  information,  and 
ideas.    These  requests  come  from  government 
agencies,  members  of  parhament  community 
groups,  and  the  press.    Canadian  Social  Trends 
is  one  response  to  these  requests. 

There  is  much  potential  for  information  which 
illuminates  the  social  conditions  which  are  the 
objects  of  policies,  programmes,  and  public 
concern.    After  some  trial  and  enor,  which 
involved  the  preparation  of  a  similar  aimual 
format  (it  was  discarded  because  of  the 
inevitable  loss  of  timeliness  in  an  annual 
publishing  cycle),  the  final  decision  was  to  go 
with  a  quarterly  format  which  at  least 
superficially  looks  like  a  news  magazine  -  a 
format  familiar  to  almost  everyone  in  western 
countries  and  one  users  expectations  of  which 
are  in  a  sense  built-in.    One  expects  succinct, 
highly  graphical  treatment  of  issues  pertinent  to 
the  reader  (they  are  not  arcane)  interspersed 
with  advertising  (information). 

The  start  of  publication  of  Canadian  Social 
Trends  is  one  notable  aspect  of  the  latter  type 
of  change. 


I  think  it  was  our  belief  when  we  started  to 
think  about  the  form  of  what  turned  out  to  be 
CST,  in  1984,  that  most  Canadians,  to  the 
extent  that  they  seek  to  use  Statistics  Canada 
material,  fall  into  one  of  these  three  categories 
or  subsets  of  them.    To' reach  such  people  with 
useful  information,  a  new  format  wotild  be 


The  New  Publication 

In  terms  of  the  number  of  its  publications  (in 
excess  of  600)  Statistics  Canada  could  well  be 
the  largest  publisher,  public  or  private,  in 
Canada.    However,  its  publications  have  by 
tradition  been  dry  and  heavily  given  to  technical 
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detail.    In  contrast,  Canadian  Social  Trends  is  an 
entirely  new  type  of  publishing  endeavour  for 
Statistics  Canada. 

The  purpose  of  CST  is  to  report  on  changing 
social  conditions  in  Canada  in  order  to  provide 
government  and  corporate  policy-makers  and 
planners,  marketers,  and  others  with  a  basis  for 
decision-making.    It  provides  recent  and 
historical  evidence  for  the  direction  and 
magnitude  of  change  of  important  social  trends. 
Its  intent  is  to  describe  and  make  evident  a 
trend's  inter-relation  with  policy  concerns,  and 
promote  an  understanding  of  social  conditions 
(and  economic  developments  where  a  direct 
social  outcome  is  established). 

The  publication  has  five  distinct  objectives: 

1.  to  provide  decision-makers  in  government, 
industry,  and  the  social  service  sector  with  an 
awareness  of  social  and  socio-economic 
trends  and  information  relevant  to  their 
environment  in  an  appealing  format 
Improving  accessibility  to  Statistics  Canada 
information  and  information  products  is  the 
prime  objective; 

2.  to  provide  users  with  a  single,  authoritative 
source  for  social  trend  description  and 
analysis  in  an  easily  accessible  and  readable 
form.    The  intention  is  to  integrate  sets  of 
data  and  make  the  connections  between 
isolated  findings; 

3.  to  provice  committed  information  users  with 
information  about  Statistics  Canada's  data 
holdings,  information  products,  and  services; 

4.  to  provide  Statistics  Canada  with  a  flagship 
with  which  to  raise  public  awareness  of  its 
corporate  character,  value,  and  contributions 
to  the  public  interest    The  publication 
supports  the  data  collection  process  by 
showing  a  public  return  of  information  to 
respondents  -  this  return  helps  justify 
impositions  on  respondents'  time; 


to  provide  a  vehicle  with  which  to  publicize 
the  existence  of  Statistics  Canada  information 
and  data  products  and  statistical  services  to 
get  the  greatest  possible  use  from  a 
considerable  investment  of  public  ftmds  in 
data  collection  while  at  the  same  time 
earning  revenues  to  help  defray  the  costs. 


Audiences  foreseen  or  already  realized  for  this 
publication  are:  policy-makers  and  planners, 
both  governmental  and  corporate,  social  service 
agencies,  educators  engaged  in  teaching  social  or 
Canadian  studies  at  the  secondary'  or 
post-secondary  level,  libraries,  the  media, 
international  information  repositories,  and 
market  research  concerns. 

Subject  matter  is  selected  according  to  a  loose 
formula.    In  each  issue  there  should  ideally  be 
found  one  article  on  each  of  the  following 
general  topics:  demography,  labour  force,  and- 
income.    In  addition,  a  rough  rotation  of  issue 
selection  exists  with  respect  to  institutional  areas 
(e.g.  culture,  health,  education,  and  justice);  and 
with  respect  to  target  groups  (i.e.  women,  the 
elderiy,  youth,  native  peoples,  and  the  disabled). 


New  Presentation  Styles 

It  is  difficult  in  words  to  describe  style. 
Therefore  I  will  show  you  some  before  and 
after  shots  of  the  same  data  in  the  old  and  the 
new  format    (see  examples  collected  together  at 
the  end  of  this  article.    Ed.  note). 

Now  entering  its  second  year  of  publication, 
CST  is  intended  to  be  the  authontative  voice 
for  Canadian  social  statistics  as  they  are 
collected  and  disseminated  by  Statistics  Canada. 
The  quarteriy  publication  attempts  to  place 
social  statistical  reporting  on  a  much  more 
accessible  basis  than  was  previously  the  case.    It 
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carries  analvtical  articles  of  var\-ing  lengths  on 
important  topics  of  social  statistical  reporting 
(such  as  the  decline  in  fertility'  and  the  increase 
in  lone  parent  families).    Each  edition  of  the 
report  is  approximately  44  magazine  format 
pages  long  and  feattires  text  written  for  the 
educated  layman.    With  25  -  30  graphics  per 
issue  (photographs  and  illustrative  drawings),  it 
is  primarily  intended  to  be  used  as  the  best 
source  of  Canadian  social  trends  material  for 
government  policy  makers,  plarmers,  and  private 
sector  marketing  specialists  in  addition  to 
students.    There  are  no  private  sector 
publications  which  compete  with  CST.    As  a 
result,  it  has  qtiickly  become  the  obvious  source 
for  Canadian  social  statistical  data  and  carries 
new  analyses  and  data  reporting  in  addition  to 
publishing  simmiary  articles  on  previously 
released  data.    In  addition,  each  edition  carries  a 
social  indicators  page  and  notices  of  the  release 
of  new  Statistics  Canada  publications  which 
might  be  of  interest  to  readers. 

What  is  really  novel  about  this  publication  is 
that  it  is  a  clear  departure  from  the  usual 
'miilti-table  and  short  descriptive  text'  format 
traditionally  associated  with  governmental 
statistical  publications.    Early  acceptance  of  the 
format  has  been  much  greater  than  initially 
expected:  Canadian  Social  Trends  already  has 
the  largest  paid  subscription  of  any  Statistics 
Canada  periodical  publication  and  indications 
are  that  interest  outside  the  country  is  high, 
perhaps  reflecting  the  fact  that  for  a  foreign 
reader  tmfamiliar  with  Canadian  social  statistics, 
this  publication  is  in  most  senses 
self-explanatory,  carrying  with  the  articles  the 
definitions  and  data  qualifications  necessary  for 
reuse  without  further  reference  to  sources.    A 
three  hour  audio-visual  presentation  is  also 
available  to  clients  for  a  fee  when  resources  are 
available. 


diskette  in  Lotus  format,  thus  gready  shortening 
their  data  collection  and  processing  times  and 
reducing  the  cost  of  data  acqtiisition  accordingly. 

The  publication  is  put  together  by  a  full  time 
staff  of  six  people,  including  myself,  with  the 
assistance  of  a  professional  design  firm  and 
students  from  Canadian  university  co-operative 
programmes.    At  any  given  time,  we  have  from 
two  to  four  imiversity  students  on  work  term 
assignments.    They  receive  course  credit  for 
working  for  the  project    In  addition,  we  employ 
some  contract  writers  and  editors,  although  the 
btilk  of  the  editorial  content  is  generated  from 
within  Statistics  Canada. 

The  publication  is  issued  in  both  official 
languages.    Page  design  is  by  a  contract  design 
firm. 


Examples 

I  will  use  the  remaining  time  available  to  me  to 
show  you  some  concrete  examples  of  what  we 
mean  by  social  trends,    (see  examples  cdlecied 
together  at  the  end  of  this  article.    Ed.  note). 
They  may  be  thought  of  as  significant  changes 
of  direction  in  the  measures  of  significant  social 
institutional  variables.n 


Underiying  the  publication  is  a  time  series 
database  containing  some  thousands  of  series. 
At  some  point  in  the  futtire,  clients  will  be  able 
to  purchase  collections  of  these  series,  on 
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The  New  OED  project 
at  Waterloo: 

old  wine  in  new  bottles 


by  D.  W.  Russell' 

Waterloo  Centre  for  the  New  OED 

University  of  Waterloo 


'Presented  at  the  International  Association  for 
Social  Science  Information  Service  and 
Technology  (IASSIST)  Conference  held  in 
Vancouver,  British  Columbia,  Canada  on  Mav 
19-22.  1987 


In  mid-1983,  three  years  before  the  fourth  and 
final  volume  of  the  Supplement  to  the  Oxford 
English  Dictionary  (OED)  appeared  in 
conventional  print  form,  Oxford  University  Press 
(OUP)  had  announced  its  intention  of 
computerizing  the  OED.    By  the  end  of  1983 
the  decision  had  been  made  that  Oxford 
University  Press  would  manage  this  large 
imdertaking  itself,  and  would  enter  into  a  series 
of  contracts/agreements  with  other  parties  (such 
as  the  University  of  Waterioo,  IBM  UK, 
International  Computaprint  Corporation  (ICC)  of 
Fort  Washington.  Pennsylvania,  among  others) 
to  carr>  out  work  on  various  aspects  of  the 
computerization. 

From  a  long-term  perspective,  the  Project's 
objective  is  to  transform  both  the  original  12 
volumes  of  the  OED  and  the  4-volume 
Supplement  into  an  electronic  database;  the 
work  to  be  done  in  arriving  at  this  objective 
has  been  broken  down  into  several  discrete 
phases:  first,  the  initial  sixteen  volumes  had  to 
be  entered  into  the  computer,  preserving  the 
original  text  organization  and  presentation.    This 
meant  the  tagging  of  structural  and 
typographical  elements  in  the  dictionary  as  the 
data  were  keyed  and  transfered  onto  magnetic 
tape.    One  of  the  aims  of  this  phase  is  to 
produce  a  printed  version  of  the  dictionary, 
integrating  the  Supplement  material  with  the 
body  of  the  OED,  and  incorporating  about  4000 
new  entries  into  this  merged  edition.    This 
version  will  be  printed  in  the  spring  of  1989.  in 
a  projected  22  volimies  at  a  price  of  about 
£1,500.    The  next  major  phase  involves  the 
design  of  a  database  structure  for  the 
machine-readable  data,  so  that  alternative 
structures  can  be  presented,  and  interactive 
querying  will  be  possible.    It  is  at  this  point 
that  the  New  OED  comes  into  being,  leading  to 
expanded,  updated,  and  revised  versions  of  the 
dictionary  thai  will  allow  several  modes  of  data 
access,  ranging  from  direct,  online  access,  to  the 
conventional  printed  version,  and  special,  printed 
subsets  of  the  dictionar,'.    As  can  be  seen,  the 
possibilities  are  many,  and  the  prospects  for 
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lexicographical  work  seem  to  expand  almost 
infinitely. 

The  magnitude  of  the  proposed  task  can  be 
appreciated  when  one  considers  the  size  of  the 
current  source  data,  the  sixteen  volimies  of 
OED  and  Supplement:  there  are  over  21,000 
pages  of  three-coltimn  print,  with  a  total  of 
about  500  million  characters  including 
pimcttiation  and  spacing.    This  breaks  down  to 
about  306,000  main  entries,  163,000  subordinate 
entries,  with  over  2,350,000  illustrative 
quotations,  and  almost  half  a  million  cross 
references.    In  addition  to  the  sheer  size  of  the 
dictionar>',  the  material  has,  as  anyone  who  has 
ever  used  the  OED  knows,  an  extremely 
complex  structure,  by  which  explicit  and  implicit 
information  is  conveyed  to  the  reader  through 
both  the  layout  and  the  typography  of  the  texL 
At  the  simplest  level,  each  entry  usually 
includes  a  headword  lemma,  a  pronunciation 
key,  a  grammatical  category  label,  a  list  of 
variant  forms  given  by  century,  an  etymological 
section,  a  series  of  sense  definitions,  each  with  a 
quotation  bank;  there  is  usually  one  quotation 
per  century,  with  date,  author,  source  and 
bibliographic  reference.    This  structure  is  made 
more  complex  by  words  or  forms  that  do  not  fit 
easily  into  the  usual  categories,  and  by  the 
inconsistencies  in  method  inevitable  in  a  project 
that  spanned  many  years  and  several  generations 
of  lexicographers. 

Still,  the  capture  of  the  dictionary'  material, 
which  had  to  be  done  by  manual  keying  of  the 
text,  since  it  was  too  complex  for  optical 
scaimers,  has  been  done,  and  done  surprisingly 
successfully  over  a  period  of  18  months,  ending 
in  June  1986.    The  enoi  rate,  as  noted  by 
human  proof  readers  hired  by  Oxford,  is  seven 
or  fewer  errors  per  10,000  keystrokes.    Working 
from  enlarged  copies  of  OED  text,  the  key 
entry  operators  entered  tags  to  identify  all  the 
typographical  elements,  as  well  as  some  of  the 
structural  elements  of  the  source  text    One  of 
the  chief  aims  at  this  stage,  was,  after  all,  to  be 
able  to  produce  a  new,  typeset  edition  in  1989. 


But,  since  even  this  seemingly  routine  task 
becomes  more  complex  when  one  is  dealing 
with  as  much  material  as  is  foimd  in  the  OED 
and  Supplement,  a  further  refinement  was 
introduced  at  this  point  in  the  process. 
Computer  science  researchers  at  "Waterloo 
developed  a  parser  which  automatically  tagged 
structtiral  elements  not  tagged  dtiring  keying, 
and  which  converted  the  ICC  tags  into  SGML 
codes.'  This  parsing  permitted  the  automatic 
validation  of  the  earlier  tagging.    The  conversion 
to  SGML  codes  was  needed  to  permit  a  greater 
degree  of  automatic  integration  by  computer  of 
the  OED  and  Supplement,  as  well  as  to 
facilitate  the  lexicographical  team's  interactive 
integration  of  the  two  source  texts. 

Meanwhile,  at  the  University  of  Waterloo, 
research  on  database  design  for  the  New  OED 
has  begun,  with  the  suppon  of  a  1.3  milUon 
dollar  gran:  from  NSERC.    The  Project  has  the 
mandate  to  design  a  database  for  the  New 
OED,  and  to  develop  software  utilities  for 
database  access,  maintenance  and  update.    This 
work  is  to  be  carried  out  over  three  years  by  a 
team  of  seven  full-time  researchers,  directed  by 
a  team  of  computer  science  faculty  researchers, 
led  by  Gaston  Gonnet  and  Frank  Tompa.    To 
date,  two  prototype  software  tools  have  been 
produced,  to  allow  interactive  access  to,  and 
sophisticated  querying  of,  the  database.    These 
tools  are  named  Goedel  and  Pat.    Although  it  is 
not  within  my  competence  to  describe  any  of 
the  technical  specifics  of  these  tools,  I  would 
like  to  offer  some  examples  of  the  results 
currently  possible,  drawing  mainly  from  my  own 
research  interests  in  the  OED,  namely  the 
identification  and  smdy  of  the  Anglo-Norman 
elements  which  were  adopted  by  the  Enghsh 
language  in  the  medieval  period. 


'See  Rick  Kazman,  Structuring  the  text  of  the 
Oxford  English  Dictionary  through  firute  state 
transduction,  (M.    Math,  thesis)  Umversitv  of 
Waterloo,  1985. 
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When  the  first  tapes  of  the  OED  became 
available  at  Waterloo  in  1985,  it  was 
theoretically  possible  to  begin  searching  the 
dictionary  interactively.    In  order  to  find  £dl 
words  labelled  as  Anglo-French  or  Law  French, 
I  needed  simply  to  ask  the  computer  to  list 
occurrences  of  the  relevant  strings,  "AF"', 
"ONF",  "Law  Fr.",  etc.    At  the  time  the  data 
were  moimted  in  a  series  of  files,  which  meant 
the  queries  had  to  be  repeated  for  each  file. 
The  results  were  not  easy  to  read,  and  although 
one  could  scroll  back  or  forward  to  identify  the 
headword,  in  the  end  it  proved  easier  to  use 
the  printed  dictionary  to  locate  the  relevant 
entry. 

After  the  creation  of  Goedel  in  1986,  a  whole 
new  range  of  possibilities  made  my  querying  of 
the  data  easier.    I  could  now  ask  for  the 
extraction  of  material  according  to  structural 
categories  in  the  dictionary  entries,  and 
according  to  specific  strings  within  each 
category.    1  decided  to  limit  my  extraction  to 
listing  the  headword  lemma,  the  material  within 
the  etymological  section,  and  the  dates  of  the 
illustrative  quotations,  for  all  entries  which  could 
be  considered  to  be  derived  from 
Anglo-Norman  sources.    Within  the  structure  of 
the  OED,  Anglo-Norman  material  is  identified 
in  various  fashions  in  the  etymology  section:  it 
may  be  labelled  as  Anglo-French,  it  may  be 
labelled  as  Old  French  (or  Old  Norman  French 
or  Old  Law  French),  or  it  may  be  found  to  be 
labelled  as  French  but  with  quotation  dales 
preceding  1500.    The  results  of  my  queries 
using  Goedel  could  be  printed  in  a  formatted 
form  which  is  eminently  readable,  and  could  be 
carried  away  for  use  elsewhere,  freeing  the 
researcher  from  being  lied  to  a  computer 
terminal.    With  the  extraction  power  and 
fiexibility  of  Goedel,  the  humanist  researcher  is 
faced  with  the  new  challenge  of  creating  more 
sophisticated  queries,  based  on  previously 
unexamined  possibilities,  and  building  on  the 
results  of  his  or  her  ongoing  interactive 
research. 


The  second  extraction  tool,  Pat,  allows  a  rapid 
and  effective  querying  of  a  difTerent  sort,  based 
in  part  on  pattern  matching;  with  Pat,  for 
example,  1  could  extract  all  entries  derived  from 
AF  or  OF  and  which  have  supporting 
quotations  from  a  particular  author,  such  as 
Chaucer  or  Gower.    Or,  to  give  another 
example,  a  researcher  looking  for  infantine 
language  was  able  to  search  the  dictionary  for 
all  words  whose  sense  definition  included 
specific  key  words,  such  as  "little"  followed  by 
"boy"  or  "giri"  or  "child"  within  a  specified 
number  of  characters.    Increasing  familiarity 
with  the  results  of  these  searches  led  to  more 
sophisticated  querying  of  the  database,  and 
raised  technical  problems  which  were  addressed 
by  the  data  structuring  group  working  on  the 
Project    In  a  similar  way,  Pat  was  used  by  a 
researcher  interested  in  the  source  and 
frequency  of  quotations.    It  is  possible  to  extract 
all  quotations  from  a  particular  author,  such  as 
Fanny  Bumey,  and  further,  to  extract  only 
quotations  from  Bumey  from  a  specified  work, 
such  as  Cecilia. 

These  short  examples,  from  among  the 
preliminary  group  of  research  projects  underway 
at  the  Waterioo  Centre,  serve  to  emphasize  the 
futuristic  nature  of  the  Project;  it  is  difficult  to 
design  a  database  for  uses  which  may  arise  in 
the  future,  but  which  have  not  yet  been 
imagined  by  humanist  researchers.    In  an 
attempt  to  come  to  grips  with  this  problem, 
OUP  and  the  University  of  Waterloo  conducted 
a  user  survey,  to  find  out  how  individuals  use 
the  OED,  to  determine  the  principal  facilities 
needed  for  the  New  OED,  and  to  provoke 
considered  responses  about  applications  for  the 
electronic  version  of  the  dictionary.    Over  1,100 
individuals  were  contacted,  of  whom  60%  were 
from  the  UK  and  Europe,  40%  from  North 
America.    The  sample  included  both  academic 
and  non-academic  users,  with  an  emphasis  on 
sophisticated  users.    The  results  have  not  yet 
been  completely  analyzed,  but  preliminary 
results  give  a  broad  outline  of  what  users  would 
like  to  see  in  the  New  OED.    Basically,  most 
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users  want  everything  currently  in  the  OED,  in 
a  fashion  that  is  simple  to  use,  quick  to  give 
results,  and  cheap  to  access.    Ideally  the  data 
will  be  available  publicly  through  an  online  data 
service,  and  privately  via  some  disk  formaL 
The  software  must  be  user  friendly,  allowing 
quick  access,  both  to  a  skeleton  summary  of 
each  entry,  and  to  a  complete  entry  or  selected 
details  of  an  entry.    The  electronic  version  must 
not  be  prohibitively  expensive,  and  yet  be 
amenable  to  continuous  updating  and  revision. 
And  finally,  the  New  OED  should  continue  to 
be  published  in  printed  form.    The  survey 
results  also  suggest  future  applications,  many  of 
which  will  require  revision  of  the  OED:  these 
include  semantic  field  searches,  frequency  ratings 
by  date  or  language  of  origin,  the  use  of  the 
OED  as  a  thesaurus,  and  so  on.    One  of  the 
suggested  applications,  requiring  the  abilit>'  to 
search  phonetics  elements,  will  be  made  much 
easier  by  the  decision  to  replace  Murray's 
phonetic  transcriptions  with  transcriptions  based 
on  the  symbols  used  by  the  International 
Phonetic  Association.    This  revision  is  to  be 
incorporated  in  the  1989  print  version  of  the 
New  OED.    Searching  on  these  phonetic 
elements  will  only  be  possible,  of  course,  in  the 
electronic  version  of  the  New  OED. 

Just  what  the  electronic  form  of  the  New  OED 
will  be  has  not  yet  been  decided.    There  are 
plans  to  market  an  exploratory  electronic  issue 
of  the  old  OED.  without  the  Supplement,  and 
without  revisions,  on  CD-ROM  in  late  1987, 
with  the  aim  of  validating  some  of  the  results 
of  the  user  survey,  and  generating  feed-back  for 
the  database  design  currendy  in  progress  for  the 
New  OED.    The  database  created  to  produce 
the  print  version  of  the  New  OED  in  1989  is 
not  now  in  a  form  which  OUP  would  offer  to 
outside  users,  but  the  Project  does  expect  to 
market  an  electronic  version  of  the  1989  and 
later  editions  of  the  New  OED.    The 
undertaking  does  present  a  number  of  technical 
and  legal  problems,  the  solutions  to  which  will 
determine  the  final  end  product    And  the 
planned  revision  and  enhancement  of  the  New 


OED  after  1989  will  certainly  carry  the  Project 
well  into  the  twenty-first  century.    Judging  from 
the  unforeseen  shifts  and  changes  in  plan  which 
plagued  James  Murray  and  the  other  editors 
involved  in  the  creation  of  the  OED  from  1879 
to  1933,  it  is  perhaps  imwise  to  promise 
completion  of  the  New  OED  Project  by  any 
given  date.    But  there  is  room  for  cautious 
optimism.    Such  is  the  position  taken  by  Tim 
Benbow,  OUP's  Director  of  the  New  OED 
Project,  in  his  Status  Report  given  at  our  second 
annual  conference  at  Waterloo  in  November 
1986.    He  said:- 

"At  the  moment  the  project  is  nmning  on 
schedule  and  on  budget    We  are  not, 
however,  complacent    Rectirrent 
nightmares  —  one,  in  which  as  a  juggler 
one  is  constrained  to  keep  an  increasing 
number  of  dictionarx'  voltmies  of 
monstrous  size  in  the  air  imder  pain  of 
lexicution,  alternating  with  a  Sisyphean 
vision  of  ordering  acres  of  dictionary  slips 
only  to  have  them  taken  by  the  wind  as 
the  last  is  about  to  be  positioned  —  see 
to  that!" 

It  is,  no  doubt,  significant  that  Tim  Benbow's 
nightmares  do  not  yet  reflect  the  presence  of 
any  textual  database  monsters.n 


•Tim  Benbow,  "Status  Report  on  the  New  OED 
Project"    Paper  given  at   'Advances  in 
Lexicology",  Second  Annual  Conference  of  the 
UW  Centre  for  the  New  Oxford  English 
Dictionary,  Waterloo,  Nov.  9-11,  1986. 
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The  sampling  bias  in 
random  digit  dialing 


by  A.  Dianne  Schmidley' 
Bell  Atlantic  Corporation 


All  research  is  based  on  information  from  data 
collection  efforts,  whether  the  data  result  from 
the  use  of  qualitative  approaches,  such  as, 
content  analysis,  focus  group  sessions,  or 
in-depth  interviewing  by  psychologists,  social 
workers  or  ethnographers,  or  from  methods 
which  produce  data  more  amenable  to 
quantitative  analyses,  such  as,  sample  surveys 
and  censuses  (technically,  a  census  is  a  100% 
sample  stirvey).    A  study-,  conducted  by  Derek 
Phillips  in  the  early  1970s,  concerning  the 
preference  of  sociologists  for  either  qtiantitative 
or  qualitative  data  collection  approaches, 
revealed  that  more  than  90  percent  of  the 
research  conducted  by  those  social  scientists 
resulted  from  the  analysis  of  data  collected 
through  the  administration  of  interview 
schedules  and/or  questionnaires  developed  for 
use  in  a  survey  setting. 

While  sociologists  are  generally  viewed  as  major 
collectors  and  users  of  the  data  resulting  from 
surveys,  they  are  not  the  only  individuals 
interested  in  survey  data.    Pollsters,  advertisers, 
program  administrators  and  others  have 
important  uses  for  survey  data.    In  fact,  there 
are  several  broad  categories  of  surveys 
including: 

1.  Attitude  and  public  opinion  polls, 

2.  Marketing  research,  including  advertising  and 
public  relations  surveys, 

3.  Government  surveys,  especially  those 
conducted  for  the  purposes  of  developing 
legislation  and  administering  and  evaluating 
the  effectiveness  of  programs, 


'Presented  at  the  International  Association  for 
Social  Science  Information  Service  and 
Technology  (IASSIST)  Conference  held  in 
Manna  Del  Rev.  California  on  Mav  21-24   1986 


4.  Special  surveys  conducted  by  researchers  in 
university  settings,  usually  the  basis  of 
primary  research,  and  often  government 
funded, 

5.  Other  surveys,  including  internal 


'Knowledge  from  What? 
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organizational  surveys  conducted  to  monitor 
attitudes  and  opinions. 


Individuals  with  training  in  survey  research 
methods  direct  many  of  these  collection 
activities;  however,  many  surveys,  particularly 
those  in  the  areas  of  marketing,  public  relations 
and  opinion  research,  are  conducted  by 
individuals  who  have  no  formal  training  and  are 
not  aware  of  the  many  pitfalls  of  survey  data. 
Once  the  ill-gotten  information  is  translated  into 
copy,  it  acquires  a  life  of  its  own  and  the  'facts' 
become  almost  impossible  to  erase. 

The  use  of  data  from  a  survey,  for  some 
purpose  other  than  that  for  which  it  was 
collected,  is  common.    For  example,  data  from 
govemmeni  surveys  such  as  the  Cunent 
Population  Survey  (CPS)  are  often  used  as  the 
basis  of  marketing  decisions,  while  data  from  a 
public  opinion  survey  may  influence  the 
decisions  of  government  policy  makers. 

The  data  from  all  five  categories  of  surveys  are 
utilized  to  support  policy  decisions  affecting  the 
expenditure  of  millions  of  dollars.    Given  the 
penchant  of  policy  makers  for  basing  decisions 
on  data  collected  through  the  use  of  surveys, 
the  need  for  a  critical  appraisal  of  each  of  the 
various  stages  of  the  survey  process  has  evolved. 
Questionnaire/interviewer  schedule  design, 
sample  selection,  the  adminisuation  of  the 
survey  process,  the  collection,  tabulation  and 
interpretation  of  the  data,  and  testing  of  the 
reliability-  and  validity-  of  the  information 
collected  from  the  survey,  compared  to  some 
benchmark,  have  all  become  subareas  of  the 
survey  research  process  and  are  carefully 
scrutinized  by  trained  social  scientists. 

The  purpose  of  this  paper  is  to  shed  additional 
light  on  the  problem  of  selecting  a 
representative  sample  of  a  population  to  be 
surveyed,  using  the  procedure  of  simple  random 
sampling. 


When  a  sample  is  selected  for  a  survey, 
attention  must  be  given  to  the  parameters  of 
the  universe  the  researcher  hopes  to  study,  and 
the  method  of  eliciting  information  from  that 
universe.    In  the  case  where  the  universe  to  be 
studied  is  a  human  population,  there  are  a 
limited  number  of  ways  of  operationally 
defining  the  universe  and  collecting  information 
from  the  individuals  comprising  that  universe. 
Each  type  of  definition  will  affect  the  method 
of  sample  selection  chosen.    If  the  universe  is 
operationally  defined  by  street  address  records, 
the  researcher  can  mail  a  questionnaire  to  each 
individual  respondent    If  telephone  numbers 
constitute  the  universe,  the  researcher  can 
telephone  respondents  and  administer  a 
questionnaire/interview  schedule.    With  street 
address  records,  the  researcher  can  meet  with 
each  respondent  and  conduct  interviews  face  to 
face. 

Generally,  evaluations  of  each  of  these  three 
approaches  suggest  that  the  higher  the  quality 
of  the  data  collected  (given  proper  development 
of  the  survey  instrument),  the  greater  is  the 
expense  of  the  survey.    Mail-back  questionnaires 
are  the  least  expensive  way  in  which  to  collect 
information  and  result  in  the  poorest  quality- 
data  with  the  lowest  response  rates,  while 
in-person  interviews  are  the  most  expensive  way 
in  which  to  collect  the  information,  but  produce 
better  respondent  cooperation.    Given  this  set  of 
circumstances  most  researchers  employ  the 
telephone  as  a  means  of  collecting  survey 
information,  since  it  produces  medium  quality 
data  at  medium  cost 

I  mentioned  earlier  that  when  drawing  a  sample 
from  a  universe,  the  researcher  operationalizes 
the  sampling  process  by  giving  the  members  of 
the  populaton  s/he  hopes  to  study  concreteness 
in  the  form  of  a  street  address  or  a  telephone 
number.    Client/customer  specific  lists,  collected 
by  some  agencies  and  firms,  can  be  used  when 
the  researcher  is  interested  in  some  subset  of 
the  population,  and  thereby  has  access  to  the 
specific  names,  telephone  numbers  and/or 
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address  records  of  that  group. 

In  most  instances,  however,  the  universe  to  be 
studied  is  only  vaguely  known.    It  may  be 
comprised  of  all  the  individuals  in  some  specific 
geographic  location  such  as  a  trading  area, 
county  or  school  zone,  or  it  may  be  a  subset  of 
the  population,  such  as  households  with  a 
certain  income,  or  people  of  certain  ages  or 
ethnicity.    At  any  rate,  owing  to  a  lack  of 
specific  information  which  could  be  used  to 
contact  and  interview  the  individuals  of  interest 
in  the  population,  the  researcher  more  often 
uses  a  method  of  simple  random  sampling 
(SRS)  of  available  street  records  or  telephone 
numbers  in  order  to  delineate  those  units  which 
will  be  sampled  for  the  purpose  of  collecting 
information  which  can  then  be  ascribed  to  the 
larger  universe  from  which  the  sample  was 
drawn. 

The  preference  of  survey  researchers  for 
telephone  interviewing  as  a  means  of  collecting 
information  from  respondents,  coupled  with  the 
need  to  employ  SRS  in  order  to  to  identify  the 
individuals  to  be  surveyed  within  the  universe 
in  which  they  are  located,  has  led  recently  to 
the  development  of  a  technique  called  'Random 
Digit  Dialing'  (RDD). 

RDD  IS  believed  by  the  naive  to  be  a  cure  for 
drawing  biased  or  non-representative  samples 
from  a  universe.    Misinformed  advocates  of 
RDD  persist  in  believing  that  universal 
telephone  service  means  that  everyone  has  a 
telephone  in  their  home.    Secure  in  this  belief, 
RDD  practicioners  promote  the  idea  that  by 
simply  dialing  the  telephone  in  a  random 
manner  one  can  draw  a  random  sample  of  the 
population  of  any  given  geographic  area. 
Census  data,  which  are  used  by  experienced 
survey  researchers,  to  design  sampling  frames 
and  to  calibrate  survey  results,  indicate  that 
universal  telephone  service  does  not  mean  that  a 
telephone  is  found  in  every  housing  unit 


'Universal  telephone  service'  is  akin  to  the 
notion  of  'full  employment'.    Full  employment 
is  defined  by  many  as  the  situation  in  which 
about  5%  of  the  population  in  the  labor  force, 
ages  16  through  64,  and  desiring  employment,  is 
unemployed.    There  aie  many  persons  between 
the  ages  of  16  and  64  who  are  not  employed, 
and  are  not  seeking  employment;  so  full 
employment  does  not  mean  that  5%  of  this 
entire  age  group  is  unemployed.    Similarly, 
imiversal  telephone  service  does  not  mean  that 
every  housing  unit  contains  telephone  network 
access.    There  are  a  variety  of  reasons  why  a 
imit  may  be  without  access  to  the  telephone 
network. 

Attachment  A  to  this  paper  gives  the  reader  the 
sources  utilized  to  determine  something  called 
telephone  penetration.    As  can  be  seen  in  this 
attachment,  there  are  basically  four  ways  in 
which  to  measure  telephone  penetration:  (1) 
using  decennial  census  data,  (2)  using 
information  from  the  CPS,  (3)  using 
GBF-DIME  records,  and  (4)  comparing 
household  counts  with  counts  of  telephone 
access  lines  in  given  geographic  areas. 

Examples  of  the  decennial  census  and  CPS 
measures  of  telephone  penetration  of  each  of 
jurisdictions  served  by  Bell  Atlantic  appear  in 
Table  1,  entitled,  "Telephone  Availabilit}'  For 
Selected  Areas:  United  States  and  Bell  Atlantic 
Ser\'ed  States,  1980-1985".    Figures  1  and  2 
illustrate  changes  over  time  in  telephone 
penetration  in  these  geographic  locations.    Table 
2  provides  and  indication  of  the  type  of  statistic 
that  can  be  developed  using  the  Access 
Lines/Households  ratio. 

Several  conclusions  are  possible  from  a  careful 
examination  of  these  various  measures  of 
telephone  penetration: 

1.   Data  from  different  questions  result  in 
different  statistics;  i.e.,  the  two  censuses 
produced  different  results  partly  owing  to 
question  wording.    These  differences. 
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although  found  in  a  comparison  of  data  from 
the  1970  and  1980  censuses,  are  illustrated  in 
Table  1  in  the  'unit'  and  'availability' 
meastires  from  the  CPS, 

2.  In  addition  to  these  difTerences,  however, 
there  are  major  difTerences  between 
geographic  locations  (Figtires  1  and  2)  and, 

3.  Differences  have  occurred  over  time  (Table 
2), 

4.  DifTerences  result  from  the  sample  selection 
process.    Table  la  reveals  the  differences  in 
sample  drawn  from  1980  census  data 
(Colunms  a  and  b);  Table  lb  illustrates  the 
range  associated  with  each  confidence 
interval  computed  for  the  various  point 
estimates  of  penetration  computed  from  CPS 
data. 


Thus  fai,  we  have  examined  the  penetration 
rate  for  the  total  nimiber  of  households.    When 
differences  in  ty-pes  of  households  is  taken  into 
account,  the  variation  in  telephone  penetration 
becomes  even  greater.    The  data  in  in  Table  3 
are  from  a  1980  Census  Public  Use  Microdata 
Sample  (PUMS).    Table  3  contains  a 
cross-tabulation  of  households  by  age  of 
householder,  household  income,  and  telephone 
penetration  for  the  state  of  Virginia.    These  data 
indicate  that  the  presence  of  a  telephone  in  the 
housing  unit  is  not  a  random  event,  but  rather 
thai  one  is  less  likely  to  have  a  phone  if  one  is 
poor  and/or  young. 

Figure  3  illustrates  the  ratio  of  telephone  access 
lines  to  households  in  the  Bell  Atlantic  region 
from  the  early  1950s  to  the  1990s  forecast 
period.    A  surprising  event  occurred  in  the  late 
1970s:  the  number  of  access  lines  increased  to 
the  point  of  outnimibering  households.    How 
can  this  be?    The  answer  is  quite  simple,  it's 
called  multiple  lines.    In  the  state  of  New 
Jersey  alone,  more  than  10%  of  households  have 
ai  least  two  lines,  and  these  households  are  not 


poor  and  they  are  not  yoimg. 

Can  one  draw  a  random  sample,  utilizing  RDD? 
Yes,  but  a  random  sample  of  what?    Certainly 
one  will  not  draw  a  random  sample  of  the 
households  in  a  given  geographic  location.    One 
will  draw  a  random  sample  of  telephone  hnes 
and,  as  we  have  seen,  these  are  not  evenly 
distributed  across  the  population. 

Does  this  matter?    It  depends  on  what  one  is 
trying  to  accomplish.    A  pollster,  trying  to 
predict  an  election,  will  want  to  examine  age 
specific  voting  patterns  and  calibrate  these  to 
the  universe  of  households  with  telephones, 
allowing  for  multiple  lines,  of  course.    Since  the 
people  who  vote  tend  to  be  those  with  higher 
incomes,  and  voter  participation  increases  with 
age,  you  will  reach  the  group  you  wish  to 
sample  using  telephone  interviewing,  although 
you  raa\  overstate  the  case,  especially  if  you  are 
ttacking  Republican  candidates.    (Table  4) 

A  marketing  researcher,  must  be  careful  to 
calibrate  the  resulting  data  with  census  data, 
because  the  upscale  population  will  be 
over-represented  and  other  groups 
imder-represented  in  the  sample.    An  academic 
or  government  researcher,  conducting  a  survey 
designed  to  produce  information  for  the 
development  of  legislation  or  the  operation  of  a 
social  program,  would  need  to  calibrate  the  data 
with  census  or  CPS  data  in  order  to  enstiie  a 
representativeness. 

In  conclusion,  it  seems  highly  unlikely  that 
telephone  interviewing  is  a  reasonable 
replacement  for  the  Bureau  of  the  Census 
traditional  decetmial  census  methods  of  data 
collection  employing  a  housing  address  hst, 
mailout  questionnaires  and  enumerators.    Given 
the  problems  associated  with  the  universe  of 
telephone  access  lines,  if  telephone  interviewing 
becomes  the  mainstay  of  data  collection  for  the 
U.S.    Government,  we  will  have  lost  one  of  the 
most  important  means  we  have  of  analysing  the 
composition  of  our  population,  n 
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Attachment  A 


METHODS  OF   MEASURING  TELEPHONE 
AVAlLABlLlTr/PESETRATION   (STATE   LEVEL) 


I.     Decennial  Census  of   Population  and  Housing 
Questionnaire 


1970  IHl       Is  there  a  telephone  on  which  people   living  in  your  quarters 
can  be  called? 


I    I     yes  what    is   the  number? 

Phone  number 

□  no 

1980  fHZd     Do  you  have  a  telephone  in  your  living  quarters? 

□  yes  n  "° 

Caveats:  *  Timliness  of  the  data 

•  Validity  I   Reliability  issues 

II.  Current  Population  Survey 

Interview 

Supplementary  questions  regarding  telephone  availability 
asked  in  March,  July  &  November  of  each  year  of  a  state  based 
sample  of  households 

Caveats:  *  Sampling  error  at  state  level 

*  Coverage  of  non-Bell  Atlantic  served  areas 

III.  Geobased  Files/Dual  Independent  Map  Encoding 

Matching  telephone  customer  address  records  with  street  address  records 
in  the  GBF 

Caveats:  *  Non-urbanized  areas  not  covered 

*  New  housing  not  covered,  although  it  can  be 
inferred  from  telephone  company  records 

IV.  Residence  Acess  Lines/Household  Estimates; 

Use  telephone  company  access  line  information  in  the'  numerator  and 
household  estimates  prepared  by  company  demographer  in  the  denominator 

Caveats:  *  Second  line  development;  failure 
of  business  office  to  identify 
second  line 

•  Accuracy  of  household  estimates; 
OK  for  internal  purposes,  may  not 
stand  up  in  legal/regulatory  setting 
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Table   1-A 
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Table   1-B 


Confidence  Intervals  Associated  With  Point  Estimates 
Of  Telephone  Penetration  for  Bell  Atlantic  Served  Jurisdictions 


Prepared 
Demogr 

by  D.  Schmidley 
aphic  Studies 
x8638 

(1) 

State 

(2) 
Unit  Measure 
Nov.  1985 

95.6 

(3) 

CVx 

(4) 
SEx 

1.42 

(5) 

Range 

68^ 

(6) 
Range 
95+  % 

DC 

0.0149 

94.2  -  97.0 

92.8 

-   98.4 

DEL 

93.4 

0.0119 

1.11 

92.3  -  54.5 

91.2 

-  95.6 

MD 

95.3 

0.0120 

1.14 

94.2  -  95.4 

93.0 

-  97.6 

NJ 

94.1 

0.0093 

0.88 

93.2  -  95.0 

92.3 

-  95.9 

PA 

95.8 

0.0059 

0.57 

95.2  -  95.4 

94.7 

-  96.9 

VA 

92.0 

0.0156 

1.44 

90.6  -  93.4 

89.1 

-94.9 

WVA 

65.1 

0.0183 

i.58 

84.5  -  87.7 

82.9 

-89.3 

Formula  from  U.S.  Bureau  of  Census 
SE  ^  X  •  CV 

X  X 

XI   =   68%  Confidence    Interval 
X2  =   95%  Confidence    interval 

SE     =   Standard  error 

X       =   Penetration  rate 
CV     ^   Coefficient   of   variation 
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Table  1-C 


Tins   tdble   is    from    the   U.S.    Bureau   of   the   Census        Current    Population   Surve/ 

FERlENTAGE    Uf-     HCJUSLHQLDS    WITH    A    TELLFHUNL    E-Y    HQUBcHOLDEK    S    Ail 


TABLE  I  C 


ALL  RACES 

WHITE 

bLAC^ 

HISF 

-iTJ 

Ur.i  t 

Ava 

1 

Ur.lt 

Ave 

1 

Unit 

Ava 

1 

Ur. 

t. 

8  MQNIH 

AVERAGE 

TOTAL  H0USEHULD5 

91  .7 

9r> 

B 

9r.  7 

95 

,;j 

B'j.  4 

B4 

B 

Bl 

1 

16-LM  YKS  OLD 

77.4 

B- 

6 

80.0 

85 

6 

57.  9 

69 

7 

62 

□ 

2^-54  YRS  OLD 

91  .B 

93 

B 

97.5 

95 

1 

B.J .  2 

84 

5 

62 

7 

S5-5'9  YRS  OLD 

94.9 

96 

1 

96.0 

96 

9 

87.2 

89 

6 

87 

4 

60-64  YRS  OLD 

95.  1 

96 

0 

96.0 

96 

B 

B7.9 

B9 

8 

B8 

a 

65-69  VRS  OLD 

95.9 

96 

7 

96.  9 

97 

5 

87.9 

90 

2 

90 

4 

70-99  YRS  OLD 

95.5 

96. 

6 

96.  1 

97 

-) 

89.  1 

91 

3 

Bl, 

8 

NOVEMBER  83 
TOTAL  HOUSEHOLDS 
YRS  OLD 
YRS  OLD 
YRS  OLD 
60-64  YRS  OLD 
65-69  YRS  OLD 
70-99  YRS  OLD 


16-24 
25-54 
55-59 


91  . 
76. 
91. 
95. 
95. 


93 

7 

93 

1 

95 .  0 

78. B 

83. 9 

BCi 

84 

1 

80 

2 

86.  2 

49.9 

68.2 

64 

93 

7 

93 

4 

95.  2 

78.7 

83.3 

81 

96 

1 

96 

1 

97.0 

86. 3 

88.5 

89 

96 

4 

96 

4 

97.  2 

89.5 

90.7 

87 

96 

2 

96 

5 

97.0 

87. 2 

89.0 

90 

96 

5 

96 

0 

97.0 

90.  1 

92.3 

85 

85. 
89. 
90. 


MARCH  B4 

TOTAL  HOUSEHOLDS 

91 

.8 

93 

.6 

93 

.  3 

94.9 

80.  1 

B4.  1 

80 

.7 

83.6 

16-24  YRS 

OLD 

77 

.6 

84 

. ':i 

80 

.  3 

85.5 

57.9 

71.5 

59 

.  0 

66.  2 

25-54  YRS 

OLD 

91 

.9 

93 

.  7 

93 

.  5 

95.0 

BO.  4 

84.0 

83 

-> 

85. 6 

55-59  YRS 

OLD 

94 

.9 

95 

.9 

95 

.7 

96.6 

87.6 

B9.9 

88 

.  7 

90.5 

60-64  YRS 

OLD 

94 

2 

95 

.  -^ 

95 

9 

96.7 

81.7 

B5.0 

67 

4 

89.6 

65-69  YRS 

OLD 

96 

1 

96 

6 

97 

0 

97.4 

87.8 

69.  3 

85 

8 

87.8 

70-99  YRS 

OLD 

95 

3 

96 

-^ 

96 

2 

97.  1 

87.2 

88.8 

82 

= 

85.5 

JULY  84 

TOTAL  HOUSEHOLDS 

91 

6 

93 

8 

93 

2 

95.0 

80.5 

85.3 

81 

1 

64. o 

16-24  YRS 

OLD 

77 

0 

83 

3 

79 

4 

85.3 

60.4 

70.0 

62 

9 

7.j.a 

25-54  YRS 

OLD 

91 

7 

93 

8 

93 

4 

95.  1 

79.8 

84.9 

83. 

1 

65.6 

55-59  YRS 

OLD 

95 

1 

96 

3 

96 

1 

97.  I 

87.5 

90.2 

87. 

4 

91  .  4 

60-64  YRS 

OLD 

95 

0 

96 

2 

95 

8 

96.9 

87.7 

69.5 

BB. 

1 

9.;. .  5 

65-69  YRS 

OLD 

96 

4 

97 

1 

97 

3 

97.9 

B9.3 

91.3 

66. 

7 

90.6 

70-99  YRS 

OLD 

95 

- 

96 

^ 

95. 

9 

96.9 

89.6 

93.  1 

64. 

0 

68.5 

NOVEMBER  84 

TOTAL  HOUSEHOLDS 

91. 

4 

93. 

6 

93. 

1 

95.0 

78.9 

84.0 

61. 

1 

64.5 

16-24  YRS 

OLD 

76. 

1 

83. 

4 

79. 

0 

85.4 

56.  3 

70.8 

60. 

8 

70 

a 

25-54  YRS 

OLD 

91. 

4 

93. 

6 

93. 

3 

95.  1 

78.5 

83.3 

83. 

1 

85 

B 

55-59  YRS 

OLD 

94. 

9 

96. 

2 

96. 

3 

97.5 

84.7 

87.4 

85. 

3 

88 

z 

60-64  YRS 

OLD 

95. 

6 

96. 

5 

96. 

5 

97.3 

90.  3 

92.  1 

86. 

ij 

67 

2 

65-69  YRS 

OLD 

96. 

0 

96. 

7 

97. 

1 

97.6 

86.7 

69.  1 

96. 

2 

96 

2 

70-99  YRS 

OLD 

95. 

3 

96. 

6 

96. 

1 

97.2 

88.0 

90.7 

87. 

1 

BB 

0 

MARCH  85 

TOTAL  HOUSEHOLDS 

16-24  YRS  OLD 

25-54  YRS  OLD 

55-59  YRS  OLD 


91  .6 
77.  3 


95 .  0 

80.  1 

64.4 

81.2 

EJ4.  1 

84.8 

59.6 

70.0 

62.4 

67.  1 

95.2 

79.5 

83.9 

83.0 

b:,.z 

96.7 

67.3 

89.  1 

86.5 

B9.  1 
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Table   2 


•a   OF  HOUSEHOLDS  WITH  TELEPHONES 
IN  THE  C&P  SERVED  AREA  OF  VIRGINIA 


Year 

(1) 

Residence 
Access 

Lines-ln 
Service 

(June  30) 
(2) 

Bell  Served 

Households 

(July  1) 

(3) 

Telephone 

Penetration 

(2)/(3)X100 

(4) 

1960 

564.824 

763.000 

74.0 

1961 

588.783 

782.000 

75.3 

1962 

616.381 

802,000 

76.9 

1953 

640,932 

823,000 

77.9 

1964 

669.764 

843,000 

79.5 

1965 

702,113 

853.000 

81.4 

1966 

736.798 

882,000 

83.5 

1957 

776.020 

904,000 

85.8 

195B 

819.205 

933,000 

87.8 

1969 

860.518 

975,000 

88.3 

1970 

896.711 

1,011,000 

88.7 

1971 

933,814 

1.027,000 

90.9 

1972, 

972.258 

1.058.000 

91.0  ~ 

1973 

1.025.160 

1.104.000 

92.9 

1974 

1,069.475 

1,134.000 

94.3 

1975 

1.097.747 

1,158.000 

94.0 

1976 

1,132,060 

1.202,000 

94.2 

1977 

1,166,408 

1.225,000 

95.2 

1978 

1,212,158 

1.263,000 

96.0 

1979 

1,255,582 

1,305,000 

96.2 

1980 

1,288,870 

1,355,000   • 

95.1 

1981 

1,319.998 

1.386,000 

95,2 

1982 

1,343,961 

1,416.000 

94.9 

Source:   (1)  Data  for  Residence  Access  Lines-In-Service  are  taken  from  the  Virginia 
monthly  No. 7  report,  years  1960-80,  and  taken  from  the  Virginia 
No.  2705  report  for  years  1981-82; 

(2)  Household  data  is  based  on  household  counts  from  the  Decennial 

Census  1960,  1970  and  1980;  and  are  based  on  the  Current  Population 
Survey  and  P-25  Population  Reports  in  other  years.  These  reports 
originate  at  the  U.S.  Bureau  of  the  Census. 

October  19£2 
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Table  3 


T(le»io«t 

Penetration  Dy  Hoe  and  Incow,   Viroinia,    l96t 

16  to 

24 

25  to 
29 

3«  to 
34 

35  to 
39 

48  to 
44 

45  to 

49 

58 

to 

54 

55  to 
59 

be  to 

iA 

£5  to 
69 

78  to 
74 

75  to 
79 

riSriLKR 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

HSHLDER 

mSKJES 

HS-1J1ES 

nousenoic  Ircoae 
Less  Tnan  ilB.eee 

t.ii 
8.79 
e.83 
e.92 

e.93 
e.9i 
i.ee 
e.94 
e.95 
e.98 
e.7a 

(.75 
e.67 
8.92 
8.9t 
8.9i 
8.98 
8.9i 
8.97 
8.99 
t.Si 
1.69 

8.77 
8.68 
8.94 
8.97 
8.98 
8.96 
8.96 
1.08 
8.99 
1.88 
8.92 

8.79 
8.68 
8.94 
8.97 
8.99 
8.99 

i.ea 

8.95 
8.99 
8.99 
8.94 

8.76 
8.69 
8.94 
8.98 
8.96 
8.99 
8.96 
1.88 
8.95 
8.95 
8.94 

8.78 
8.98 
8.94 
8.97 
8.99 
8.99 
8.99 
1.88 
8.99 
8.99 
8,94 

8.68 
8.98 
8.95 
8.96 
8.97 
8.99 
8.99 
8.95 
1.88 
8.99 
8.94 

8.63 
8.94. 
8.97 
8.96 
8.99 
8.99 
8.99 
8.96 
1. 88 

i.e« 
8.95 

8.66 
8.95 
8.96 
8.96 
8.99 
8.99 
l.»« 
8.99 
8.96 
8.95 
8.94 

8.69 
8.97 
8.97 
8.97 
8.96 
8.96 
8.95 
1.88 
1.88 
8.96 
8.94 

8.9e 

8.97 
8.96 
8.99 
8.99 
8.99 
1.88 
1.88 
8.97 
8.99 
8.93 

8.91 

ili.m  -  H9S9 

8.97 

IIMM  -  19999 

e.95 

»2«M  -  £4999 

8.99 

$e5«ee  -  29999 

8.98 

am  -  34999 

8.96 

I35W!«  -  39999 

8.97 

»4»W  -  44999 

1.88 

»45eee  -  49999 

1.88 

»5«e8e  or  (tore 

8.97 

Total 

8.93 

data 
S 


MC  data  bst  generated   from   the   1980  Public  Use  Microdata  Sample  by  the  Pennsylvania 
tate  Data    C;nter,    at  the  request  of  A.D.    Schmldley,    Bell  Atlantic 
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Table  4 


TELEPHONE  PENETRATION 

BY  AGE  OF   HOUSEHOLDER   (%) 

(1980  CENSUS) 


AGE  RATE  AGE  RATE 


18-2A 

83.8 

25-29 

92.0 

30-34 

94.5 

35-39 

95.3 

AO-44 

95.5 

45-49 

95.8 

50-54 

96.0 

55-59 

96.2 

60-64 

96.2 

65-69 

96.0 

70-74 

96.2 

75* 

95.6 

■«»is  data  was  generated  from  the  Public  Ut.e  Microdata  Samples  FUM3 
for  the  Bell  Atlantic  served  jurisdictions  of  Washington,  D.C.,  Delaware, 
Maryland,  New  Jersy,  Pennsylvania,  Virginia  and  West  Virginia,  combined. 
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Figure    1 
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Figure   2 
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Figure   2 
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Evaluation  criteria  for 

the  selection  of 

computer  mapping 

systems 


by  Marci  L.  Belcher' 

Marketing  Manager 

MPSI  Systems,  Inc. 

650  California  Street,  31st  Floor 

San  Francisco,  CA  94108 

(415)  434-3222 


Significant  differences  exist  between  the 
computer  mapping  products  available  in  today's 
market    The  way  in  which  data  are  managed 
and  manipulated  by  a  system,  in  which 
information  is  represented  in  the  mapped  end 
product,  the  resources  required  for  use,  and  the 
level  of  service  provided  by  the  vendor,  are  all 
important  criteria  that  distinguish  one  product 
from  another.    Ultimately,  the 
computer-mapping  system  selected  should  satisfy 
the  application  needs  and  environment  of  an 
organization.    However,  identifying  the 
decision-making  criteria  to  use  in  the  selection 
process  can  be  difTicult  unless  one  is  aware  of 
features  available  across  products,  and 
imderstands  how  diese  translate  into  potential 
benefits  or  problems. 

Computer  mapping  systems  use  data  as  raw 
material  in  order  to  present  relationships  in  a 
visual,  geographic  form.    Therefore,  a  mapping 
system  designed  to  truly  support  market  analysis 
should  be  capable  of  organizing  and  managing 
all  the  information  required  for 
decision-making.    One  of  the  first  issues  to 
address  in  selecting  a  mapping  product  then  is 
whether  a  mainframe  computer  or 
microcomputer-based  system  is  reqtiired. 

Centralized  organizations  with  large  varied  data 
files  on  customers,  demographics,  lifestyle 
profiles,  competitive  informaton.  consumption 
patterns,  performance,  etc.,  should  consider  a 
mainframe  based  product    Mainframe  based 
systems  are  designed  to  process  large  data  files 
efficiently,  execute  tasks  quickly,  and  perform 
more  complex  fimctions.    The  cost  of  mainframe 
based  products  are  higher  than  micro  based. 
However,  most  of  these  systems  help 
orgamzauons  optimize  their  investment  in  data 
by  bringing  the  information  to  multiple  users 
for  different  applications.    Mainframe  computers 
also  tend  to  provide  more  powerful  analytical 
capabilities,  increasing  the  utility  of  maps  in 
complex  problem-solving. 


'Presented  at  the  IASSIST  86  Annual 
Conference  and  Workshops,  May  23,  1986 
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Decentralized  or  smaller  organizations  may  find 
that  microcomputer  based  mapping  systems  ofTer 
some  of  the  capabilities  of  mainframe  packages 
at  a  lower  cost    Each  micro  workstation  is  an 
autonomous  unit  in  which  the  data  base  may  be 
designed  and  enhanced  on  an  individual  basis. 
Packages  that  interface  easily  with  spreadsheet 
or  reporting  programs  allow  users  to  build  a 
data  base  for  mapping  that  includes  information 
from  a  variet}'  of  sources.    The  greatest  issue 
related  to  selection  of  a  micro  based  system  is 
the  limit  to  the  amoimt  of  data  that  can  be 
stored  and  used  at  one  time.    New  laser  disk 
storage  has  provided  users  with  access  to  a  large 
volimie  of  vendor  supplied  data.    However, 
these  data  cannot  be  integrated  with  other 
information  so  that  relationships,  for  example, 
between  a  company's  retail  performance  and  the 
corresponding  demographic  profile  of  that  sales 
area,  can  be  viewed  on  the  same  map. 

Mapping  systems  index  data  to  geographic  imits. 
Products  differ  greatly,  however,  in  the  number 
of  geographic  types  to  which  information  may 
be  linked  and,  therefore,  the  types  of  data 
which  may  be  represented  in  maps.    All 
mapping  systems  display  information  through 
areas  such  as  census  tracts  or  postal  zones. 
Mapping  systems  that  also  index  data  to 
locations  and  networks  provide  users  with  a 
higher  level  of  analytical  and  interpretive 
capabilit>-.    A  retail  location's  performance, 
product  mix  or  pricing  may  be  thematically 
represented  at  different  outlets.    Traffic  counts, 
type  of  roadway  or  directional  information  may 
be  shown  by  varied  transportation  networks.    A 
system  that  represents  information  linked  to  two 
or  three  geographic  layers  in  the  same  map 
allows  users  to  evaluate  performance  at  a  retail 
outlet  in  the  context  of  the  underiying  market 
demand,  the  success  and  spatial  location  of 
competitors,  and  the  level  of  traffic  fiow  near 
the  facility'. 

The  structure  of  a  system's  data  base  can  limit 
the  flexibility-  the  which  the  information  may  be 
used.    In  systems  with  parallel  or  hierarchical 


structures,  information  requests  or  queries  must 
conform  to  the  way  in  which  data  are  organized 
in  the  data  base.    With  a  relational  structure, 
any  data  item,  be  it  a  map  element  or  an 
attribute  in  a  related  table,  serves  as  an  entry 
point  or  "front  door"  to  the  entire  data  base. 
Every  attribute  related  to  an  individual 
geographic  unit  is  also  linked  to  all  others.    As 
a  result,  questions  may  be  based  on  decision 
criteria  or  rules,  in  a  way  that  replicates  the 
human  thought  process. 

The  benefit  of  a  relational  data  base  structure  is 
that  it  may  be  possible,  if  the  software  product 
is  appropriately  designed,  to  query  the  data  base 
using  multiple  criteria  related  to  the  attributes 
of  geographic  areas.    For  example,  the  user  may 
want  to  see  only  census  tracts  that  have  median 
home  values  greater  than  $100,000  in  which  he 
has  less  than  a  15%  customer  penetration  level. 
A  solution  set  of  only  the  units  meeting  these 
criteria  is  formed. 

These  units  may  be  further  color  segmented  by 
another  variable  such  as  percentage  of 
population  between  the  ages  of  20  and  34  years 
of  age.    Geographic  units  that  meet  a 
first-order  set  of  criteria  are  first  identified. 
These  tmits  may  then  be  automatically 
segmented  by  variables  that  support  more 
concrete  decision-making.    This  type  of  system 
performs  the  work  that  multiple  overlays  of 
single  variable  maps  ultimately  achieve,  but  with 
greater  efficiency  and  speed. 

A  major  problem  in  data  analysis  has  been  the 
inability  to  integrate  data  that  did  not 
conespond  to  common  units  of  geography. 
Often  data  important  for  planning  are  excluded 
because  they  are  not  comparable  to  other 
information  in  a  common  framework.    This 
problem  is  solved  with  a  computer  mapping 
system  that  integrates  data  through  its  inherent 
common  element  -  geography.    Disparate 
geographic  tmits  are  merged  through  an  overlay 
process  in  the  system,  producing  solutions  to 
queries  that  are  made  up  of  the  imique 
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geographic  segments  that  meet  all  imposed 
criteria.    Resulting  geographic  units  may  then 
transcend  standard  geography  and  consist  of  the 
intersections  of  units  such  as  zip  codes  and 
census  tracts. 

Map  overlay  eliminates  the  need  to  continually 
geocode  data  to  a  consistent  geographic  unit 
Customer  information  linked  to  zip  codes  may 
be  analyzed  directly  with  demographic 
information  based  on  census  tracts.    In  addition, 
if  the  overlay  process  includes  two  or  three 
types  of  geography  (areas,  networks,  locations), 
relationships  between  information  based  on 
disparate  geographic  types  may  be  studied.    For 
example,  only  retail  locations  with  poor 
revenues  that  are  also  in  areas  with  high 
potential  demand  for  the  products  or  services 
offered  could  be  mapped.    The  time  consumed 
in  map  interpretation  is  greatly  reduced  because 
a  key  problem  or  situation  is  directly  identified 
through  the  query.    In  maps  with  every  area 
and  location  represented,  a  labor  intensive  and 
often  non-methodical  search  process  must  be 
conducted  to  identify  relevant  areas. 

Often  data  are  not  stored  in  the  form  most 
useful  for  analysis.    Some  products  feature  the 
ability  to  perform  mathematical  operations  using 
attribute  items  from  the  data  base  in  any  valid 
arithmetic  expression.    New  data  items  formed 
may  be  used  immediately  in  a  query  without 
making  permanent  modifications  to  the  data 
base.    For  example,  raw  counts  of  quarteriy 
sales  performance  can  be  changed  to  percent 
growth.    Demand  estimates  could  be  developed 
by  applying  an  equation  such  as  (target 
population  X  percent  expected  conversion  X 
average  dollar  sale). 

As  an  aid  in  selecting  meaningful  data 
categories  for  segmentation,  some  systems 
display  the  distribution  and  range  of  values  for 
each  variable.    Products  may  also  automatically 
create  categories  with  even  counts  of  data,  or  let 
users  select  breakpoints,  and  automatically 
didplay  the  resulting  histogram.    Histogram 


displays  may  represent  the  distribution  of  a 
variable  on  an  individtial  or  cumulative  basis  by 
the  number  of  geographic  units  or  by  the  size 
of  the  area  in  each  category. 

Representing  raw  coimts  or  even  percentages  of 
data  in  maps  often  leads  to  the  misinterpretation 
of  information.    Systems  that  perform  density 
calculations  help  users  understand  that  even 
though  a  large  zip  code  region  has  a  large 
population,  that  population  may  be  much  less 
concentrated  than  in  a  small  region  with  a 
smaller  number  of  residents.    Through 
dot-density  mapping,  the  concentration  of  data 
may  be  viewed  on  a  macro-level. 

The  ability  of  a  system  to  zoom-in  and  increase 
the  resolution  of  map  elements  before  plotting 
is  important  in  applications  which  include  the 
study  of  locations  or  street  networks  that  may 
reside  in  close  proximit)-.    Retail  bank  branches 
are  often  located  on  the  four  comers  of  one 
intersection.    Displaying  all  bank  offices  in  one 
county  would  mean  that  many  locations  would 
be  plotted  on  top  of  one  another.    The  ability 
to  increase  the  resolution  should  be  contrasted 
with  what  many  systems  accomplish  when  they 
enlarge  a  screen  display.    It  is  the  difference 
between  blowing  up  a  photograph,  to  enlarge 
elements,  and  using  a  zoom-lens  so  that  greater 
detail  is  actually  captured. 

Managers  under  pressure  to  make  fast  decisions, 
or  overloaded  with  work,  can  quickly  test 
hypotheses  in  systems  with  on-line  interactive 
planning  capabilities.    Computer  screen 
geographic  solutions  are  immediately  produced 
and  may  be  scanned  for  important  relationships. 
An  entire  nation  may  be  analyzed  using  broad 
criteria,  then  each  area  of  interest  seen  in 
greater  detail  using  stricter  criteria  on  smaller 
geographic  units.    A  systematic  macro  to  micro 
anal  ysi tic  framework  helps  pinpoint  target  areas 
for  further  study  in  a  highly  efficient  maimer. 

The  extent  to  which  maps  are  used  to 
understand  spatial  relationships  and  distance  will 
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be  an  important  criterion  in  deciding  whether 
the  system  reqtiired  should  produce 
canographically  accurate  maps.    Many  micro 
based  products  today  create  a  graphic  image  of 
map  elements  but  do  not  relate  these  elements 
to  a  projection  system  or  scale.    Cartographically 
acctirate  mapping  systems  usually  display 
longitude  and  latitude  "tick"  marks  and  present 
a  scale  for  use  in  calctilating  exact  distances 
between  specific  loctaions. 

Mapping  systems  that  summarize  information  in 
rings  around  a  location,  within  a  bank  of 
geography,  by  solutions  sets,  or  in  various  other 
shapes  give  insight  into  the  specific  composition 
of  target  markets.    Systems  that  integrate 
information  from  various  data  sotirces  are  able 
to  produce  reports  on  a  full  spectrum  of 
information  about  areas.    Some  reporting 
functions  enable  users  to  specify  custom  layouts, 
others  rely  on  defaiilt  parameters.    Packages 
with  a  high  degree  of  data  maniptilation  and 
calculation  capability  eliminate  the  need  to 
transfer  data  for  this  purpose  to  other  software 
packages. 

Another  feattire  to  consider  in  computer 
mapping  systems  is  the  abilitv'  to  display  data 
items  in  the  data  base  near  the  corresponding 
geographic  elements.    These  items  are  not 
necessarily  part  of  a  quer,',  but  provide 
additional  detail  about  specific  areas,  locations, 
or  networks.    For  example,  the  average  family 
size  and  lifestyle  profile  label  could  be  printed 
in  areas  that  are  also  being  color-segmented  b\' 
sales  performance. 

Computer  mapping  systems  differ  greatly  in  the 
amount  of  attention  that  has  been  paid  to 
information  display  on  finished  maps.    Hatch 
and  cross-hatch  patterns  are  available  for 
plotting  in  a  variety-  of  angles.    Multiple 
symbols  and  patterns  such  as  dashes  and  double 
lines  may  be  used  to  segment  locations  and 
road  networks.    Text  for  title  and  legend  boxes 
may  be  automatically  developed  by  the  software 
and  displayed  in  a  selection  of  font  styles  and 


sizes  anywhere  sunounding  the  map  image. 

To  the  extent  that  it  is  possible,  the  colors  and 
patterns  selected  for  terminal  displays  should   . 
extend  directly  to  map  specifications  so  that  this 
information  does  not  have  to  be  re-entered. 
Systems  with  a  direct  interface  to  off-the-screen 
color  printers  exactly  replicate  a  saeen  image 
on  paper.    This  method  of  map  production  is 
the  easiest  and  fastest,  displaying  first  on  the 
computer  screen  exactly  what  will  be  produced 
in  the  printed  map. 

The  degree  of  user  friendliness  may  mean  the 
difference  between  whether  the  computer 
mapping  system  becomes  a  successful  and  often 
used  decision  support  tool.    Products  should 
include  on-line  help  programs,  fully  prompted 
input  screens,  and  English-like  syntax. 
Familiarit>-  with  a  programming  language  should 
not  be  required.    It  should  be  possible  to  save 
and  recall  for  editing  queries  and  plotting 
specifications.    Finally,  a  system  should  be 
ergonomically  easy  to  use  by  individuals  with  a 
wide  range  of  functions  and  skill  levels. 

Most  companies  offering  computer  mapping 
packages  also  supply  geographic  files.    Vendors 
may  specialize  in  providing  highly  detailed  map 
files  that  include  features  such  as  cities,  parks, 
airports,  lakes,  rivers,  bridges,  rail  and  rapid 
transit  lines  as  well  as  multi-layered 
transportation  networks.    These  features  add  a 
significant  level  of  detail  that  reduce  or 
eliminate  the  need  to  use  base  maps  for  further 
reference. 

A  product  which  includes  the  capability  of 
digitizing  and  encoding  files  will  allow  the  user 
to  add  or  modify  map  elements  on  an  ongoing 
basis.    This  may  be  a  particularly  desirable 
feature  if.for  example,  the  market  typically 
under  study  has  rapidly  changing  elements  such 
as  retail  locations,  or  if  an  organization's  sales 
regions  or  other  administrative  units  frequently 
change. 


Fall/Winter   1987 


62  - 


iassist    quarterly 


The  ease  with  which  the  tabular  data  and  map 
files  in  a  system  may  be  modified  is  important 
if  the  system  is  to  be  continually  enhanced. 
Information  such  as  sales  performance  may  be  a 
desired  addition  on  a  qimrterly  basis.    It  should 
also  be  possible  to  easily  add  additional 
geographic  areas  and  units  to  the  system,  with 
programs  linking  new  data  to  the  corresponding 
map  files.    Learn  whether  a  major 
reorganization  of  sales  regions,  or  additions  in 
product  line,  would  require  restructuring  of  the 
data  base,  and  how  easily  this  is  accomplished. 

Turnkey  systems,  which  integrate  software  and 
hardware  into  a  readily  usable  package,  are 
convenient  but  rarely  allow  custom  modifications 
that  can  make  a  system  much  more  valuable  to 
a  specific  organization.    For  example,  a  system 
that  cannot  integrate  proprietary  company 
information  or  data  from  other  vendors  offers 
limited  scope. 


toll-free  telephone  assistance. 

Vendors  confident  the  functionality  and  benefits 
of  their  product  may  offer  test  or  pilot  projects 
to  enable  companies  to  gain  hands-on 
experience  at  little  or  no  cost    Taking 
advantage  of  these  programs  will  promote 
confidence  that  the  system  selected  is  the  right 
one  for  the  application  needs  and  structure  of 
your  organization,  n 


Include,  in  the  cost  of  a  system,  the  cost  of 
other  software  required  for  operation  either  on 
a  micro  or  mainframe  computer.    Additional 
hardware  may  also  be  required  such  as  graphics 
boards,  emulator  cards,  terminal  controller, 
modem,  and,  of  course,  plotters  and/or  printers. 
For  mainframe  products,  it  will  be  important  to 
understand  memory  (region)  and  disk  storage 
requirements. 

Selecting  a  vendor  who  bases  his  success  on  the 
satisfaction  of  his  customers  is  an  important 
ingredient  in  selecting  the  right 
computer-mapping  product    Vendors  with 
experienced  personnel  in  your  industry'  or 
function  may  provide  valuable  application 
support    Vendors  that  market  demographic  data, 
map  files  and  mapping  software  offer  one-stop 
shopping  that  can  reduce  the  time  spent  with 
several  different  vendors.    Software  maintenance 
programs  provide  enhanced  versions  of  a 
product,  thereby  maintaining  capabilities  close  to 
state  of  the  art    And  it  is  most  important  that 
vendors  should  provide  training,  installation, 
testing,  tutonals,  system  documentation,  and 
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California  law  specifically  states  that  persons 
listed  for  service  in  the  court: 

shall  be  fairly  representative  of 
the  population  in  the  area 
served  by  the  cotirt  and  shall  be 
selected  upon  a  random  basis 
(Section  9.  203). 


Introduction 

Geographic  groupings  which  overlap  with  racial 
and  economic  groupings  constitute  recognizable 
classes.^  Rural  residents,  for  example,  might  be 
underrepresented  due  to  excuses  based  on 
distance  to  the  courthouse;  often  selection 
officials  acquiesce  in  the  reluctance  of  rural 
residents  to  serve.    Excuses  based  on  willingness 
to  travel  great  distances  have  so  reduced  the 
jur}'  pool  that  remedial  action  is  required  even 
without  proof  of  geographic  cohesiveness.^ 

The  notion  of  vicinage  or  geographical  locality 
requirement  of  jury  selection  has  been  traced  at 
least  as  far  back  as  to  Charlemagne  (Charles 
the  Great)  in  768  A.D.    He  instituted  several 
reforms,  one  of  which  was  the  establishment  of 
"Inquisito".    One  of  the  requirements  of  the 
Inquisito  was  that  13  to  66  witnesses  be  chosen 
from  the  neighborhood  where  they  woiild  have 
knowledge  of  the  matter  in  dispute  (Moore 


^See  Thiel  v.    Southern  Pacific  Co.,  328  U.S. 
217  (1946),  State  v.    Holstrom,  43  Wis.  465,  168 
N.W.  2d  574  (1969).  and  State  v.    Cage,  337  So. 
2d  1123  (La.  1976).    The  Federal  Act  requires 
that  selection  procedures  "ensure  that  eacn 
coimty',  parish  or  similar  political  subdivision 
within  tne  district  or  division  is  substantially 
proportionally  represented  in  the  master  jury 
wheel  for  that  judicial  district  division,  or 
combination  of  divisions"  (U.S.  1968,  Section 
1863  (b)  (3)). 

^  See  United  States  v.    Fernandez,  480  F.  2d 
726,  732-33  (2d  Cir.  1973). 
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1973).    The  requirement  of  a  jury  from  the 
vicinage,  found  in  the  Magna  Carta  as  well  as 
the  Sixth  Amendment,  is  also  based  on  the 
notion  that  jurors  should  be  selected  from  local 
residents.    The  meaning  of  this  requirement  is 
often  unclear,  as  when  a  case  from  one  division 
in  a  federal  district  court  is  tried  in  another 
division,  or  when  grand  jurors  are  selected  from 
only  one  division.' 

Ciurently,  federal  law  determines  the  nature  of 
prospective  jurors  by  specifying  two  key 
concepts  in  jury  venire  or  panel  selection 
procedures:  (1)  "a  random"  selection  of  jurors, 
and  (2)  the  inclusion  of  special  geographic 
districts  wherein  a  panicular  court  convenes,  i.e., 
vicinage  requirements  (U.S.  1968,  Section  1861). 
Ln  California,  as  in  most  states,  the  law  similarly 
requires:  (1)  a  random  selection  of  jurors  and 
(2)  selection  from  "judicial  districts  of  the 
respective  counties"  (CA.  1981,  Section  197, 
206).    Recent  Federal  and  California  Supreme 
Court  decisions  are  such  that  any  substantial 
violation  of  these  basic  requirements  of  jur>' 
selection  in  representativeness  is  a  prima  facie 
case  of  discrimination.'  Subsequently,  an 
increasing  number  of  challenges  concerning  the 
underrepresentation  of  "cognizable  groups,"  e.g., 
minorities,  have  been  brought  claiming  violation 
of  the  Sixth  Amendment,  a  representative  jur\' 
selected  from  a  fair  cross  section  of  the 
communiri'.' 


'  See  U.S.  1968,  Section  1861  and  House  Report 

at  1801.  ^ 

'  See  lis,  90th  Congress  Senate  Report  No. 
891  19"E7riJ^90th  Congress  House  Report  No. 
TU76  1968;^T¥eTaIe  Law  JoufnarT970:  KaifvT 
WJ2:  De  Cam  iW^,  Chevigny  1975;  Alker    ' 
Hosticka,  and  Michel!  1976;  Kairys,  Kadane, 
and  Lehoczky  1977;  Alker  and  Barnard  1978' 
Heyns  1979;  Butler  1980a,  1980b,  and  1981; 
Butler  and  Fukurai  1984;  Fukurai  and  Buder 
1985. 

'  For  example  in  California,  see  People  v 
White  43  Cal.  3d  740  1954;  People  v.    King  49 
Cal.    Rptr.  562  1966;  People  v.    Sirhan  7  C^. 
3d  258  1978;  People  v.    Wheeler  148  Cal. 
Rptr.  890  1978;  People  v.    Estrada  155  Cal. 


One  of  the  major  problems  in  jiuy  challenges  is 
to  establish  a  prima  facie  case  for  the 
imderrepresentation  of  minorities.    Part  of  the 
problem  is  due  to  the  ambiguous  relationship 
between  random  selection  and  vicinage 
requirement  (area  or  district).    Past  Supreme 
Coim  cases  have  dealt  with  the  systematic 
imderrepresnetation  of  cognizable  groups,  e.g., 
blacks  and  Hispanics;  however,  the  Coim  has 
not  addressed  the  extent  to  which  the  area 
served  by  the  court  relates  to  the  random 
selection  of  potential  jurors.    Vicinage  or 
geographic  representativeness  has  rather  been 
dealt  with,  along  with  the  random  selection  of 
jurors,  without  geographic  representativeness 
being  clearly  demarcated. 

In  Duren  v.    Missouri,  for  example,  the  U.S. 
Supreme  Court  held  that  a  three-prong  test 
must  be  applied  to  establish  a  prima  facie  case 
of  discrimination:  (1)  the  group  alleged  to  be 
excluded  is  a  'distinctive'  group  in  the 
community,  (2)  the  representation  of  this  group 
in  venires  from  which  jurors  are  selected  is  not 
fair  and  reasonable  in  relation  to  the  number  of 
such  persons  in  the  communit>',  and  (3)  this 
underrepresentation  is  due  to  systematic 
exclusion  of  the  group  in  the  jun'-selection 
process  (Duren  v.    Missouri  439  U.S.  357  364 
1978).    However,  the  Cotm  did  not  spell  out  a 
clear  cut  relationship  between  juror 
representativeness  and  the  vicinage  requirement, 
e.g.,  what  is  the  "commtmity"? 


"(cont'd)  Rptr.  731  1979;  People  v.    Graham 
160  Cal.    Rptr.  10  1979;  People  v.    Harris  36 
Cal.  3d  36.  201  Cal.    Fpn.  ih  679  R  2d  433 
1984.    In  Federal  Supreme  Court  see  Alexander 
v.    Louisiana  405  U.S.  625  1972;  Peters  v.    ICiff 
407  U.S.  493  1972;  Tavlor  v.    Louisiana  419 
U.S.  522  1975;  Duren  v.    Missouri  439  U.S.  357 
1979;  City  of  Mobile,  Ala  v.    Bolden  466  U.S. 
55  1980. 
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Vicinage  Requirement 

The  vicinage  or  geographical  requirement  of 
jur>'  trials  is  an  essential  element  of  the  Sixth 
Amendment  as  it  pertains  to  the  jury  selection 
process.    For  illustrative  purposes,  we  will  use 
Los  Angeles  Count}'  and  its  twent>'  mile  radius 
rule.    However,  the  process  itself  is 
generalizable  to  all  areas  of  Los  Angeles  Count)' 
and  any  other  bounded  area  such  as  a  county, 
judicial  district,  etc. 

The  first  provision  for  a  jury  trial  in  a  vicinage 
can  be  found  in  Article  EI  of  the  Constitution. 
Article  III.    Section  2  notes: 

The  Trial  of  all  Crimes,  except  in  Cases 
of  Impeachment,  shall  be  by  Jury;  and 
such  Trial  shall  be  held  in  the  State 
where  the  said  Crimes  shall  have  been 
committed;  but  when  not  committed 
within  any  State,  the  Trial  shall  be  at 
such  Place  or  Places  as  the  Congress  ma\- 
by  Law  have  directed. 

Early  in  the  1970s,  the  Los  Angeles  Count}' 
Board  of  Supervisors  developed  a  policy  that  no 
juror  had  to  travel  more  than  twent}'  miles  from 
his/her  house  to  the  courthouse.    The  Count}' 
Board  adopted  this  rule  because  of  convenience 
for  prospective  jurors  and  economic  reasons  for 
the  county.    Subsequently,  in  California,  the 
legislature  defined  the  judicial  district  in  Los 
Angeles  Count}-  as  being  within  a  twent}'  mile 
radius  from  each  courthouse.    The  California 
Code  of  Civil  Procedure  states  that: 

Each  court  shall  adopt  rules 
supplementar}'  to  such  rules  as  may  be 
adopted  by  the  Judicial  Council, 
governing  the  selection  of  persons  to  be 
listed  as  available  for  service  as  trial 
jurors.    The  persons  so  listed  shall  be 
fairiy  representative  of  the  population  in 
the  area  served  bv  the  court,  and  shall  be 


selected  upon  a  random  basis.    Such  rules 
shall  govern  the  duties  of  the  court  and 
its  attaches  in  the  production  and  use  of 
the  juror  lists.    In  counties  with  more 
than  one  coun  location,  the  rules  shall 
reasonably  minimize  the  the  distance 
traveled  by  jurors.    In  addition,  in  the 
Count}'  of  Los  Angeles  no  juror  shall  be 
required  to  serve  at  a  distance  greater 
than  20  miles  from  his  or  her  residence 
(CA.  1981,  Section  7.  203). 

Despite  the  explicit  rule  of  random  selection  of 
potential  jurors  from  the  judicial  district  defined 
within  20  mile  radius,  recent  jur}'  venire 
challenge  cases  have  argued  the  following  two 
points:  (1)  there  is  a  significant 
underrepresentation  of  prospective  minority 
jurors  and  (2)  there  is  an  overrepresentation  of 
particular  neighborhoods  with  high 
concentrations  of  anglos  (Hevns  1979;  Butler 
1980a,  1980b,  1981;  Butler  and  Fukurai  1984; 
Huebner-Dimitrius  1984;  Fukurai  and  Butler 
1985;  Fukurai  and  Butler  1986).    These  studies 
have  shown  that  census  tracts  with  a  high  anglo 
concentration  are  consistently  overrepresented 
and  consequently  jur}'  venires  have  consisted  of 
a  large  number  of  potential  anglo  jtirors,  and  an 
underrepresentation  of  minorities.    This  is 
apparent  for  all  Superior  Court  districts  in  Los 
Angeles  Coimt}'  except  the  Central  District 
(Heyns  1979). 

It  is  theoretically  possible  to  have  race/ethnic 
representation  on  juries,  yet  not  have  a  fair 
cross  section  of  the  communit}'  or  areas  served 
by  the  court  from  which  jurors  are  being  drawn 
to  serve  on  juries  (Heyns  1979; 
Huebner-Dimitrius  1984;  Fukurai  1985;  Fukirrai 
and  Butler  1985).    Generally  however,  racial  and 
geographic  representativeness  are  highh" 
correlated;  therefore,  it  is  possible  to  ensure  the 
cross-section  representation  of  minorities  by 
controlling  the  random  selection  of  geographic 
areas.    Currently,  the  ovenepresentation  of 
particular  neighborhoods  contributes  to  a 
substantially  greater  chance  of  anglos  serving  on 
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juries,  while  a  random  selection  of 
neighborhoods  would  ensure  the  fair 
representation  of  minorities  as  prospective  jurors 
within  a  districL 

In  this  paper,  we  present  an  analytic  strategy 
that  will  overcome  racially  disproportionate  jury 
venires.    Rather  than  first  focusing  on  the 
selection  of  individual  potential  jurors,  random 
selection  of  neighborhoods  is  examined,  i.e., 
census  tracts  from  which  prospective  jurors  are 
being  drawn  to  serve  on  juries.    Our  analysis 
demonstrates  the  extent  to  which  neighborhood 
representativeness  could  rectify  the 
disproportionate  underrepresentation  of 
minorities  currentiy  the  case  in  most  jur}' 
venires.    The  main  thrust  of  this  paper,  thus,  is 
threefold:  (1)  to  propose  a  geographc  sampling 
strategy  to  overcome  imderrepresentativeness  of 
minorities,  (2)  to  illustrate  our  strategy  using 
simulation  techniques,  and  (3)  to  show  the 
extent  to  which  geographical  randomness  can 
help  ensure  that  racially  proportionate  jury 
venires  are  obtained.    By  simulating  the  Los 
Angeles  Cotmt\'  selection  process,  a  comparison 
between  the  actual  jury  composition  and  the 
simiilated  jur,-  composition  is  examined  to  show 
the  extent  to  which  the  proposed  geographic 
samphng  strategy  is  superior  to  the  current 
selection  procedures  employed  in  Los  Angeles 
Cotmty  and  elsewhere. 


Data 

Two  data  sets  were  linked  to  serve  as  the 
foundation  for  the  simulation  of  the  jury 
selection  process:  (1)  1980  U.S.    Census  Bureau 
data  and  (2)  jury  impanelment  hsts  for  a  retrial 
of  the  Hams  case  (36  Cal.  3d  36  201  Cal. 
Rptr.  782  679  P.  2d  433  1984).' 


Eight  jiu7  impanelment  lists  were  obtained  to 
delineate  neighborhoods  (census  tracts)  from 
which  jurors  were  being  drawn  to  the  Long 
Beach  Superior  Court  and  to  determine  whether 
or  not  the  panels  represented  a  fair  cross 
section.    The  impanelment  period  imder 
investigation,  while  not  ideal,  was  lengthy 
enough  to  determine  whether  or  not  jury 
venires  were  representative  of  the  commimity 
population.    These  eight  panels  were  typical  of 
panel  data  available  for  other  time  periods, 
including  the  first  Harris  trial  1979. 


Empirical  Analysis 

Figure  1  depicts  racial  composition  of  the  Long 
Beach  judicial  district  using  a  variety  of 
definitions  of  the  area  served  by  the  court 
Map  A  illustrates  the  areas  served  by  the 
Superior  Court  as  presented  in  the  Harris 
retrial.    (Ed.Note.    Figures  and  maps  have 
collected  together  at  end  of  article) 

This  variation  in  the  definitions  of  the  area 
served  by  the  court  shows  that  there  is  a 
potential  for  either  conscious  or  imconscious 
manipulation  of  minority  representation  on  jury 
panels.    Thus  the  particular  "area  served  by  the 
court"  becomes  important  in  jury  challenges. 
That  is,  if  it  is  to  be  determined  whether  or  not 
jurors  represent  a  fair  cross  section  of  the 
community,  the  area  served  by  the  court  must 
be  clearly  delineated  or  a  valid  comparison 
cannot  be  made. 

Six  different  areas  served  by  the  court  emerged 
during  the  Harris  retrial.    The  first  was  Los 
Angeles  Coimry  as  a  whole.    In  the  first  Hams 


Empirical  analyses  of  People  v.    Harris  (36 
Cal.  5d  36,  201  Cal.    Rptr.  782  679  ?.  2d  433 
1984)  were  performed  at  University  of 


'(cont'd)  California.  Riverside.    In  People  v. 
Harris,  the  motion  of  respondent  for  leave  to 
proceed  in  forma  pauperis  was  granted; 
however,  the  Writ  of  Certiorari  by  the 
prosecution  to  the  Federal  Suprerne  Court  was 
denied  on  Oct  29,  1984. 
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trial  tlie  prosecution  argued  that  Los  Angeles 
County-wide  data  were  the  proper  comparison. 
The  Harris  opinion  rendered  by  the  California 
Supreme  Court  concluded  that 

The  parties,  however,  presented  evidence 
and  argued  this  case  on  the  assumption 
that  all  jtmes  in  Los  Angeles  Coimty 
must  be  representative  of  the  entire 
county.    The  principal  question  before  us 
is  whether  evidence  based  on  total 
countywide  population  figures,  rather  than 
jtiry-eligible  population,  is  adequate  to 
make  out  a  prima  facie  case;  for  the 
reasons  explained  in  this  pinion,  we 
conclude  that  it  is.    The  state  has  not 
attempted  to  rebut  this  prima  facie 
showing  by  arguing  that  the  Long  Beach 
juries  need  only  represent  those  persons 
living  within  20  miles  of  the  courthouse, 
and  has  not  attempted  to  show  that  such 
juries  were  truly  representative  of  that 
limited  area  (Harris  36  Cal.  3d  36.  201 
Cal.    Rptr.  782  679  P.  2d  433  1984). 

A  second  area  served  by  the  court  is  within  a 
20-mile  region,  as  delineated  by  California  state 
law;  that  is,  any  juror  may  be  excused  from 
being  sent  to  a  particular  courthouse  that  is 
further  than  20  miles  from  his/her  residence. 
In  1978,  the  20-mile  region  delineated  by  the 
Jur>-  Services  Division  in  Los  Angeles  County 
was  for  the  most  part  a  20-mile  straight  line 
from  the  courthouse.    However,  in  1983  the 
area  served  by  the  court  was  reduced  to  a 
15-mile  direct  line,  presumably  on  the  basis  that 
the  driving  distance  would  be  20  miles.    Thus,  a 
third  definition  of  the  area  serviced  by  the 
court  was  considered. 

A  fourth  area  served  by  the  court  was 
empirically  delineated.    This  area  was 
determined  by  delineating  those  census  tracts 
from  which  jurors  were  summoned  for  eight 
panels.    A  fifth  area  sen-ed  by  the  court  could 
not  be  determined  geographically  but  is 
obviouslv  different  from  the  others.    This  fifth 


area  is  a  subset  of  the  fourth  area  which  was 
geographically  determined.    For  each  juror 
summoned,  knowledge  of  their  census  tracts  and 
address  was  made  available,  thus  the  area 
served  by  the  court  could  be  empirically 
determined.    All  of  the  potential  jurors  who 
showed  up  at  the  Long  Beach  Cotirthottse  came 
from  these  impanelments  and  thus  were  a 
subset  of  the  impanelment  lists.    However, 
between  the  impanelment  or  summons  stage  and 
the  jun-  venire  stage,  there  was  between  a 
40-50  percent  dropout    Thus,  they  are  similar 
but  not  the  same.    Unfortunately  we  were 
unable  to  delineate  areas  of  residence  at  this 
stage. 

Finally,  during  the  course  of  the  Harris  retrial, 
the  prosecution  argued  that  a  sixth  area  was 
more  important  than  these  other  five.    This 
specific  area  was  known  as  the  Long  Beach 
Superior  Court  District,  as  defined  by  the  Los 
Angeles  Board  of  Supervisors.    This  area  is  used 
by  the  legal  system  in  allocating  trials.    Thus,  if 
a  person  commits  a  crime  in  this  bounded  area 
and  it  becomes  necessary  to  have  a  trial,  it 
typically,  but  not  invariably,  will  be  assigned  to 
tjie  Long  Beach  Courthouse.    However,  this 
particiilar  area  is  not  coterminous  with  any  of 
the  other  five  areas  served  by  the  court 
Obviously,  all  the  areas  are  within  Los  Angeles 
Count)',  but  otherwise  they  have  nothing  in 
common. 

Table  1  shows  the  racial  composition  of  the 
eligible  Hispanic  population  and  impanelment 
hsts  using  the  15-mile  radius  definition  of  the 
area  served  by  the  court    Thus,  while  20.9 
percent  of  potential  jurors  at  a  15-mile  level 
were  Hispanic,  only  10.1  percent  of  jurors 
impaneled  and  summoned  to  the  Long  Beach 
Superior  Court  were  Hispanic. 
Underrepresentation  of  Hispanic  jurors  was 
inevitable  because  of  the  under-selection  of 
Hispanically  dominant  census  tracts.    Table  1 
thus  shows  that  more  than  one-half  of  the 
potential  Hispanic  jurors  were  underrepresented 
on  the  impanelment  list    Z  scores  and 
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chi-square  values  show  that  Hispanic  jurors  are 
statistically  underrepresented  on  the 
impanelment  list;  thus,  the  Hispanic  composition 
on  the  impanelment  lists  is  significantly  different 
from  the  racial  composition  of  the  Long  Beach 
judicial  district,  as  defined  by  the  15-mile 
radius. 

The  underrepresentation  of  both  Hispanic  and 
black  jurors  on  the  eight  panels  under 
investigation  was  consistent    Table  2  shows  the 
racial  composition  of  minority  jurors  on  both 
impanelment  lists  and  census  tracts  from  which 
potential  jurors  are  simimoned.    Census  tracts 
with  a  high  concentration  of  anglos  are 
overrepresented  whereas  minority  dominated 
census  tracts  are  underrepresented. 

Table  3  shows  the  representation  of  census 
tracts  on  eight  impanelments.    Potential  jurors 
from  one  census  tract  were  represented 
thirty-six  times,  while  fiftv-one  census  tracts 
were  represented  less  than  five  limes.    Table  3 
also  indicates  that  one-half  the  potential  jitrors 
came  from  twenty-three  census  tracts  (5.2%)  of 
the  total  of  439  tracts  in  the  Long  Beach 
judicial  district  defined  by  a  15-niile  radius. 


a  high  concentration  of  anglos.    In  any  case, 
and  for  whatever  reason,  anglo  dominant  census 
tracts  are  clearly  ovenepresented  on  the  jury 
impanelment  lists. 

One  dubious  explanation  is  that  anglos  are  more 
qualified  than  minority  groups  for  jury  duty.* 
However,  the  proportion  of  qualified  juiors  is 
the  same  for  the  impaneled  census  tracts  and 
the  Long  Beach  Superior  Court  judicial  district 
as  a  whole.    Table  5  shows  the  proportion  of 
qualified  jurors  in  the  impanelment  list  and  the 
Long  Beach  judicial  district    While  19.9  percent 
of  jurors  in  impaneled  census  tracts  are 
qualified  jurors,  19.1  percent  of  those  in  the 
Long  Beach  judicial  district  are  equally 
qualified.    Thus,  the  percentage  of  qualified 
jurors  has  no  bearing  on  the  underrepresentation 
of  minority  jurors.    Further,  a  random  method 
of  selecting  jurors  has  not  been  exercised,  i.e., 
impaneled  census  tracts  are  clustered  in 
particular  regions  characterized  by  an  anglo 
population.    Many  minority  dominant  census 
tracts  are  not  included  in  the  impanelment  list, 
even  though  the  proportion  of  qualified  jurors  is 
the  same  in  both  impaneled  and  non-impaneled 
census  tracts. 


Table  4  indicates  the  average  representation  of 
census  tracts  on  eight  panels.    Tlie  table  shows 
the  extent  to  which  tract  representation  is 
related  to  the  racial  composition  of  the  census 
tract    For  example,  census  tracts  which  were 
selected  less  than  the  average  number  of  times 
had  three  and  ten  percent  higher  black  and 
Hispanic  populations  respectively.    One-half  the 
overrepresented  census  tracts  had  four  and 
fourteen  percent  less  black  and  Hispanic 
population,  respectively. 

Maps  1  to  8  illustrate  the  census  tracts  from 
which  actual  potential  jurors  were  summoned. 
These  maps  show  that  the  census  tracts  were 
concentrated  in  particular  regions,  i.e.,  the  lower 
portions  which  border  on  Orange  County. 
From  previous  tables,  it  should  be  obvious  by 
now  that  these  census  tracts  are  characterized  by 


'  Research  indicates  the  ovenepresentation  of 
anglo  jurors  is  necessary  since  criminality  is 
inherent  in  some  minorit^■  groups  (Hepburn 
1978;  Cullen  and  Link  1980;  Turk  19§1; 
Kramer  1982).    Thus,  minority  groups  "take[s]  a 
permissive  view  of  crime  within  its  Dorder.    As 
a  result,  the  black  community  is  vulnerable  to 
its  own  criminal  element  as  well  as  to  the 
criminal  element  of  the  white  communitv"  (The 
Yale  Law  Journal  1970,  p.534).    Further^ 
researchers  suggest  that  count\'  clerks  responsible 
for  selecting  names  from  master  files  purposely 
exercise  systematic  selection  rather  than  random 
selection  in  creating  raciallv  disproportionate 
jury  pools  (Alker  and  Barnard  1978;  Levine  and 
Schweber-Koven  1976).    Because  so  many 
different  persons  use  individual  discretion  to 
decide  who  should  be  excused  and  who  should 
serve,  the  possibility  of  individual  prejudice 
influencing  excuses  and  exemptions  is  great 
(Van  Dyke  1977,  p.391). 
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Geographic  Random  Selection 

One  means  by  which  to  rectify  the 
disproportionate  representation  of  census  tracts  is 
to  implement  the  random  selection  of  census 
tracts  within  a  judicial  district,  but 
geographically  defined.    Such  random  selection 
should  provide  a  Hst  of  census  tracts  equally 
distributed  within  the  limited,  spatially  bounded 
context,  i.e.,  15-mile  radius,  or  whatever.    Since 
our  analysis  shows  that  qualification  of 
particular  racial  populations  does  not  have  a 
bearing  on  the  selection  of  anglo-dominant 
census  tracts,  the  random  selection  of  tracts 
provides  a  foundation  for  equally  selecting 
various  racial/ethnic  groups  within  them  thus 
resulting  in  a  fair  cross-section  of  the 
population,  vis-a-vis  minorities. 

A  simulated  random  selection  of  census  tracts 
was  carried  out  in  the  following  manner.    Each 
census  tract  within  a  20-niile  radius  of  Long 
Beach  judicial  district  was  given  a  unique 
number.    A  series  of  random  numbers  were 
generated  for  the  selected  number  of  census 
tracts  for  each  of  eight  panels.    Those  eight 
individual  simulations  were  conducted  to 
conespond  to  the  actual  eight  impanelments  as 
previously  empirically  analyzed  and  used  in  the 
Harris  retrial. 

Maps  9  through  16  illustrate  the  simulated 
mapping  of  census  tracts  randomly  selected 
within  the  Long  Beach  judicial  district    Each 
map  shows  that  selected  tracts  are  evenly 
distributed  in  space.    The  nimiber  of  potential 
jtirors  also  shows  that  using  this  process, 
minority  groups  would  have  an  eqtial  chance  of 
selection  for  jiiry  service. 

Table  6  shows  the  racial  composition  of  selected 
census  tracts  for  each  of  the  simulated  panels. 
Within  the  2(>-mile  radius  26.7  percent  of 
potential  jtirors  were  Hispanic  and  14.8  percent 
for  black.    A  Z  scores  statistical  test  for 


difi'erences  in  racial  composition  between  census 
tracts  derived  by  random  selection  and  the 
20-mile  radius  district  was  then  carried  ouL 
Not  one  of  the  scores  was  significant,  suggesting 
that  each  of  the  randomly  selected  samples  of 
tracts  had  a  racial  composition  similar  to  that  of 
the  20-mile  radius  judicial  district    This,  of 
course,  is  in  stark  contrast  to  the  actual 
impanelments  analyzed  in  the  first  section  of 
this  paper. 

Map  17  shows  the  mapping  of  all  census  tracts 
in  eight  panels  using  the  simtilation  method. 
The  map  shows  that  random  selection  of  census 
tracts  provides  a  virttial  equally  distributed  hst 
of  tracts  from  which  potential  jurors  would  have 
been  summoned.    Such  random  selection  also 
provides  an  imbiased  racial  representation. 


Critique 

The  results  of  our  simulation  clearly  show  that 
the  process  we  have  suggested  is  far  superior  to 
the  current  process  in  ensuring  a  fair  cross 
section  of  jurors. 

One  question,  of  course,  is  whether  or  not  this 
process  is  allowable  under  current  Federal  and 
State  stames.    Our  response  is  that  not  only  is 
it  allowable,  but  the  results  of  the  simulation 
imply  that  our  process  shotild  be  mandated  by 
law.    Another  argument  that  possibly  could  be 
made  against  the  proposed  process  is  that 
qualification  varies  by  district    However,  our 
evaluation  of  the  actual  juror  qualification  rate 
for  Los  Angeles  County  compared  with  the 
Long  Beach  District  (20-mile)  showed  that  the 
qtialification  rate  was  virttially  identical.    Even  if 
there  had  been  some  variation,  such  variation 
could  be  fined  into  the  system. 

.Another  possible  objection  to  the  randomized 
geographical  process  is  that  it  would  increase 
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the  overall  mileage  driven  by  jurors.    This  is 
true.    Any  system  that  results  in  a  fair  cross 
section  will  result  in  more  aggregate  miles 
driven  because  of  the  very  fact  that  the  jurors 
would  be  from  all  areas  of  the  district  rather 
than  concentrated  in  certain  areas.    This  is  a 
necessary'  part  of  a  system  that  results  in  a  fair 
cross  section  of  the  community  —  jurors  must 
come  from  all  parts  of  the  community.    The 
proposed  system  does  away  with  the  idea  of 
selecting  only  jurors  from  areas  closest  to  the 
court,  and  in  fact,  requires  just  the  opposite. 
That  is,  jtirors  are  drawn  from  all  areas  of  the 
district    However,  the  district  could  still  fall 
within  the  state  law  mandated  20-mile  region 
for  Los  Angeles  County. 

In  Los  Angeles  Coimty,  a  particular  problem 
that  must  also  be  dealt  with  is  the  overiapping 
of  judicial  district  boundaries.    This  problem  is 
amenable  to  statistical  sampling  methods. 
However,  even  if  some  ovenepresentation 
should  occur,  it  would  be  substantially  less  than 
is  now  occurring  using  non-random  selection  of 
areas. 

Finally,  the  analysis  presented  here  represents 
only  pan  of  a  year  and  thus  might  be 
considered  static.    A  dynamic  jury  selection 
process  involves  selecting  jtirors  periodically. 
However,  jurors  also  are  qualified  only 
periodically.    Thus  a  dynamic  system  of  jury- 
qualification  could  use  the  same  technique 
described  in  the  simulation  section.    That  is,  the 
jury  qualification  process  could  also  be 
accomplished  by  the  random  selection  of  census 
tracts,  and  the  mailing  out  of  questionnaires 
periodically  throughout  the  year. 


Conclusions 

Our  analysis  conclusively  shows  that  currently 
there  is  a  systematic  and  biased  selection 
method  employed  in  the  Long  Beach  judicial 
district  and  elsewhere  in  Los  Angeles  County. 
The  racial  composition  of  actual  impaneled 
census  tracts  indicates  that  (1)  selected  census 
tracts  are  clustered  in  regions  with  high 
concentrations  of  anglos  and  (2)  Hispanic  and 
black  potential  jurors  are  systematically  weeded 
out  in  the  selection  process  because  of  biased 
impanelment  lists.    One  possible  reason  for  such 
systematic  selection  of  anglo  dominant  census 
tracts  might  be  that  anglo  jurors  in  particular 
census  nacts  are  more  qualified  than  their 
minority  counterparts.    However,  the  proportion 
of  qualified  jurors  from  anglo  dominant  census 
tracts  was  the  same  as  that  of  the  judicial 
district  as  a  whole. 

We  suggested  an  alternative  sampling  strategy  of 
random  selection  of  census  tracts  which  provides 
a  representative  list  of  tracts  from  which 
potential  jurors  could  be  summoned.    Our 
simulation  analysis  showed  that  selected  census 
tracts  could  provide  a  list  of  potential  jurors 
that  would  be  unbiased,  i.e.,  racially 
representative.    That  is,  the  impanelment  lists 
would  have  a  racial  composition  similar  to  the 
judicial  district    Our  method  of  randomly 
selecting  census  tracts  is  clearly  superior  to  the 
selection  method  currently  employed  in  Los 
Angeles  County,  because  the  potential  jurors 
coming  from  the  selected  tracts  are  evenly 
distributed  and  have  an  equal  chance  of  being 
selected.    The  random  selection  of  census  tracts, 
thus,  is  congruent  with  requirements  established 
by  both  the  Federal  Jury  Selection  and  Service 
Act  in  1968  and  the  California  Code  of  Civil 
Procedure  in  1981.' 


'  Federal  Jury  Selection  and  Service  ka  was 
passed  in  1968  to  guarantee  that  "all  litigants  in 
Federal  courts  entitled  to  trial  by  jury  shall 
have  the  right  to  grand  and  petit  juries  selected 
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Cullen,  Francis  T.  &  Bruce  G.    Link.  1980. 

Crime  as  an  occupation.    Criminology 
18:399-410. 
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Figure   1 
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Table   1 
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Table  3 
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Table  4 
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Map   1 


LONG  BEACH:  APRIL  24,  1985 
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Map   2 


LONG  BEACH:  MAY  1,  1985 
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Map  3 


LONG  BEACH:  MAY  8.  1985 
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Map  4 


LONG  BEACH:  MAY  15.   1985 
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Map  5 


LONG  BEACH:  MAY  22,   1985 
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Map  7 


LONG  BEACH:  JUNE  5.   1985 
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Map  8 


LONG  BEACH:  JUNE   12,   1985 
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Table  5 


LONG  3s:ach  Superior  col'rx 


Long    Beach  Judicial   Dlscrict  lapaneljieai:    Llscs~ 

No.                           Percent  No.                Percenc 

local  Jurocs                         243,274                       lOOZ  62,753               lOUJ; 

Qualified   Jurors                    46,436                           19.11  12,459                  19. 9Z 

1.  Total   number   of   census    tracts   are  439. 

2.  Total   number  of   census    tracts   are   107. 


*   Source:    Los   Angeles   Jury   Supervisor    Ray   Arce   and   his    computer 
consultants,    1985 
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LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   11 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   12 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   13 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   14 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map  15 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   16 


LONG  BEACH  SUPERIOR  COURT  DISTRICT 
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Map   17 


LONG  BEACH  DISTRICT:  8  PANELS 
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CD-ROM  and  the 
data  archive: 

beyond    retrieval 


Optical  storage  offers  data  archives  a  new 
medium  for  storing  data  and  a  change  in  the 
information  retrieval  environmenL    As  with  any 
new  technological  development,  it  is  important 
to  evaltiate  the  impact  and  tradeoffs  of  adapting 
to  new  eqtiipment  and  systems.    By  stepping 
back  for  an  overview  of  information  storage  and 
access,  better  evaluations  may  be  made  within 
the  context  of  the  "big  picttire"  of  data  use. 
Before  embracing  this  new  technology,  it  is 
important  to  understand  where  we  are  going 
and  how  optical  storage  can  help  us.    Optical 
media  offer  some  solutions  to  storage  problems, 
but  have  not  yet  shown  themselves  to  be 
solutions  to  the  overall  needs  of  information 
providers  and  users. 


by  Aim  Gerken' 

Universirv  of  California,  Berkeley 


'Presented  to  The  Association  of  Public  Data 
Users  October  22,  1987,  Washington,  DC 


First  of  all,  thanks  go  to  Forrest  Williams, 
Census  Bureau,  and  to  George  Hall  and 
Courtney  Slater,  of  Slater-Hall  Information 
Products,  and  to  Ed  Spar,  of  Market  Statistics. 
These  people  have  provided  us  with  CDROM 
lest  disks  for  our  evaluation,  and  I  sincerely 
appreciate  their  support    Some  of  the  points  I 
will  discuss  today  have  been  included  in  the 
report  of  the  State  Data  Center's  Subcommittee 
on  CDROM.    This  repon  is  available  from 
John  Kavalaimis  of  the  Data  User  Services 
Division.    My  ideas  do  not  necessarily  reflect  all 
the  opinions  of  that  subcommittee. 

Consider  the  advantages  of  optical  media  as 
storage  devices.    CDROM  and  WORM  disks 
provide  a  minimimi  of  10  years'  secure  storage. 
The  disks  provide  random  access,  and  are 
portable  and  compact    However,  there  are  some 
problems.    For  most  data  archives  who  rely 
upon  large  centralized  computing  systems,  the 
shift  from  storing  data  on  tapes  and  disk  packs 
to  storing  data  on  microcomputer  peripherals 
means  a  dramatic  shift  in  responsibilm'.    How 
will  data  archives  provide  mulnple  user  access 
to  data  on  microcomputers'^    Can  archives 
handle  the  equipment  and  network  burden? 
Another  problem  with  CDROM  is  that  archives 
cannot  produce  CDROM  disk  copies  in  house; 
there  is  no  local  recording  capability.    This 
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means  thai  if  data  archives  are  to  produce 
optical  backups  of  their  holdings,  they  must  use 
WORM  disks,  and  will  have  to  acquire  and 
maintain  two  kinds  of  optical  devices.    Other 
problems  are  access  speed,  and  in  some  cases, 
limited  storage  capability  (CDROMS  only  hold 
4  reels  of  tape,  and  STF4  for  California  takes 
up  14  reels  of  tape).    Another  issue  to  consider 
is  that  the  software  to  access  the  data  on 
CDROMs  and  WORMs  has  not  kept  pace  with 
the  technology.    Even  though  information  on 
microcomputers  appears  to  be  closer  to  the  end 
user,  the  interfaces  and  support  for  access  are 
often  limited.    And  finally,  we  must  ask  when 
will  opdcal  storage  be  cost  effective,  and  when 
will  the  major  data  distributors  begin  to  use  this 
technology? 

Information  retrieval  is  the  process  of  extracting 
specific  data  items  from  specific  records  in  a 
data  file.    At  this  time,  information  retrieval 
software  is  produced  for  each  CDROM  product 
since  standard  retrieval  software  for  statistical 
data  files  is  not  yet  available.    At  its  best,  this 
retrieval  process  is  informed,  flexible,  logical, 
and  fast    However,  since  each  data  file  comes 
with  its  own  custom  software  there  is  a  wide 
range  in  the  quality  of  these  software  interfaces. 
Users  will  be  required  to  learn  multiple  retrieval 
processes.    Also,  stadsticaJ  CDROM  products  are 
more  expensive  to  produce  since  producing 
custom  software  for  each  data  file  requires 
extensive  programming. 

The  premastering  and  production  of  the  optical 
disks  themselves  are  different  from  producing 
magnetic  tape.    The  physical  process  itself  is 
more  expensive,  and  the  nature  of  the  files  on 
the  disk  differ.    Since  the  mediimi  is  randomly 
accessed,  "disk  geography."  or  where  on  the 
disk  various  records  will  be  placed,  is  more 
important  than  with  tapes.    Disk  geography  can 
improve  performance  if  done  carefully.    Disk 
indexes  should  be  produced  and  mastered  with 
the  data  file  if  the  disk  is  to  prove  efficient  in 
sophisticated  retrieval  applications.    Error 
detection  and  correction  codes  must  also  be 


provided.    In  summary,  "raw"  data  files  on 
CDROM  should  be  more  carefully  developed 
than  those  provided  on  magnetic  tape.    To 
exploit  the  technology  and  access  software, 
optical  products  should  be  distributed  with  value 
added  files. 

After  information  is  retrieved,  "beyond 
retrieval",  lies  the  worid  of  post  processing 
where  the  information  is  manipulated  and 
packaged  to  user  specifications.    At  its  best,  this 
processing  should  be  supported  by  an  integrated 
system  providing  clear  steps  for  users  to  follow. 
Information  retrieved  from  CDROM  products 
should  be  compatible  with  inhouse  software,  and 
transfer  of  the  information  should  be  standard 
and  simple. 

In  oiu'  evaluations,  we  considered  three 
CDROM  products  in  relation  to  data  product 
preparation,  information  retrieval,  and  post 
processing.    The  three  CDROM  products  we 
evaluated  were:  Census  Test  Disk  #1,  which 
includes  data  and  software  for  ZIP  codes, 
population  estimates,  and  the  1982  Census  of 
Agriculture;  Slater  Hall's  1982  Census  of 
Agriculture  disk;  and  "Your  Marketing 
Consultant"  form  Market  Statistics.    The  Census 
disk  offers  simple  extract  programs  that 
demonstrate  the  usefulness  of  optical  storage. 
The  procedures  for  retrieval  are  straight  forward 
and  can  be  done  by  any  novice  computer  user. 
A  highlight  of  die  Slater  Hall  software  is  its 
online  help  for  each  variable  in  the  data  base, 
as  well  as  its  numeric  search  capabilities.    "Your 
Market  Consultant"  offers  the  capability  of 
sorting  data  by  user  defined  criteria,  and  allows 
users  to  add  their  own  data  to  the  tables. 

Our  evaluation  at  the  State  Data  Program 
emphasized  the  retrieval  and  post  processing 
capabilities  of  these  products,  but  did  not  take  a 
close  look  at  the  premastering  and  production  of 
the  CDROMs.    The  issues  of  disk  geography 
and  indexing  are  yet  to  be  evaluated.    However, 
in  the  area  of  retrieval,  it  is  clear  that  a 
standard  procedure  has  emerged  among  these 
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products.    Users  progress  through  a  series  of 
simple  procedures,  first  selecting  geographic 
areas,  then  identifying  a  table  or  variables,  then 
extracting  data  from  the  disk,  and  finally 
producing  display,  report  or  file  output 

What  more  do  archives  and  users  need  from 
this  kind  of  product?    Given  that  a  CDROM 
product  provides  the  information  that  a  user 
wants,  a  separate  procedure  is  often  required 
after  retrieving  information.    A  user  must  sort 
and  create  custom  tables,  graphics,  or  spread 
sheets  in  another  system.    Obviously,  there  is  a 
need  to  coordinate  software  compatibilit>-  and 
command  language  among  in-house  software,  so 
that  users  are  not  faced  with  a  confusing  array 
of  software  and  transfer  methods.    In  addition, 
the  lack  of  numeric  manipulation  and  statistical 
analysis  in  the  CDROM  software  requires  an 
interface  with  statistical  software  ctirrently  in 
use.    Data  as  well  as  textual  information  must 
be  moved  in  efficient  ways  and  in  compatible 
formats. 

None  of  the  current  CDROM  products  contain 
microdata,  i.e.  data  for  individual  households, 
persons  or  establishments.    The  data  that  are 
available  are  summaries  for  geographic  units, 
useful  for  many  applications,  but  are  only  part 
of  the  larger  universe  of  data.    Extracts  of 
microdata  files  are  regularly  produced  in  the 
archive  environment,  and  products  and 
procedures  using  CDROM  technology  should  be 
considered. 

Our  suggestions  are  that  1)  data  distributors 
keep  an  open  mind  about  which  kind  of  optical 
media  to  adopt,  so  that  archives  are  not  forced 
to  develop  dual  systems,  one  for  data  acquisition 
(CDROM)  and  one  for  data  file  backup 
(WORM).    We  also  are  concerned  about  the 
technology  becoming  outdated.  2)  Optical 
products  should  be  mastered  to  allow  for 
optimal  retrieval  by  a  variety  of  software, 
including  relational  database  management 
software.  3)  Retrieval  software  should  provide 
output  that  is  compatible  with  other  software. 


and  the  transfer  of  data  should  be  an  integral 
part  of  the  software  product  4)  Data 
distributors  should  view  their  products  not  only 
as  stand-alone  retrieval  systems,  but  as  pieces  of 
a  larger  information  support  system. 

Some  questions  for  the  future:  How  can  optical 
storage  be  integrated  into  the  existing 
information  fabric?    Who  will  build  the 
interfaces  for  informed  transfer  of  data  from 
system  to  system?    Do  we  place  responsibility 
upon  the  private  sector  to  develop  retrieval 
products,  and  how  involved  cjm  we  become? 
What  information  systems  are  in  use  that 
identify,  retrieve,  transfer,  and  analyze  data? 
How  may  these  systems  influence  the 
development  of  CDROM  and  WORM  products? 

Data  archives  have  a  long  history  of  responding 
to  technological  changes:  from  IBM  cards  to 
low  density  tapes,  from  fioppy  diskettes  and 
random  access  disk  packs  to  optical  media.    Eric 
Tanenbaum  wrote:  "Archives  have  a  privileged 
role  among  information  providers  for  they  were 
among  the  first  to  computerize.    Thus  they  offer 
a  rare  perspective  from  which  to  view  the 
changes."    (IASSIST  Ouarteriv.  Spring  1986)  I 
believe  that  archivists  will  be  studying  these 
changes  and  will  offer  a  unique  perspective  in 
emphasizing  the  integration  of  new  technology 
into  existing  services  and  procedures.    Archivists 
and  other  information  professionals  have  an 
opporttmity  to  participate  in  the  development  of 
standard,  reliable  storage  media,  to  assist  in  the 
design  and  evaluation  of  powerful  fiexible 
retrieval  software,  and  to  help  create  dynamic 
post-retrieval  environments.n 
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Contents   of  Current  Journals 


Contents  of  current  journals 


European  political  data  newsletter 
nr.  64  September  1987 

Data  Section: 
Research  in  progress: 

p.    4  The  Luxembourg  income  study. 

p.  17  The  development  of  trade  unions  in  western  European  societies. 

p.  21  Study  of  the  Spanish  trade  union  movement  during  the  post-Franco  transition./  Salvador 
Aguilar 

Books: 

p.  23  Ten  years  later:  contributions  to  the  analysis  of  the  party  system  in  Spain./  Gabriel  Colome 

p.  25  Election  more  about  who  governs  than  what  government  does? 

Archive  news: 

p.  27  Membership  in  IFDO. 

Computer  section: 

p.  30  Political  science  information  exchange  via  available  telecommunications  facilities:  electronic  mail 

and  electronic  conferencing./  Per  Nielsen 
p.  35  Micro-CDS/ISIS:  a  bibliographic  information  management  software  from  UNESCO./  Peter 

Jasco  et  al. 

Books: 

p.  52  Danish  elections  1920-1979.    A  logit  approach  to  ecological  analysis  and  inference./  .lostein 

Ryssevik 
p.  56  4CaST/2:  a  time  series  forecasting  and  data  analysis  system, 
p.  58  SHAZAM:  a  general  computer  program  for  econometric  methods  (version  5). 

Forthcoming  events: 

p.  60  APDU:  12th  annual  conference. 
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p.  61  IASSIST:  14th  annual  conference. 

p.  62  SSDBM:  4th  international  working  conference. 

p.  63  ICDBHSS:  3rd  annual  conference. 


Bits  &  bytes  review 
vol.  1(5),  April  1987 

p.     1  Product  reviews:  spelling  programs.    Four  MS-DOS,  stand  alone  spelling  programs: 

Jet:Spell(tm),  MicroSpell,  Word  Proof  II,  Webster's  NewWorld. 
p.  17  Coming  events, 
p.  18  What's  news, 
p.  20  Fine  print 
p.  20  In  future  issues. 


Bits  &  bytes  review 
vol.  1(6),  May  1987 

p.    1  Product  reviews:  five  stand-alone  spelling  programs  for  Macintoshes  -  Spelling  Champion, 

Spellswell,  Thunder,  Liberty  Spell  II,  WorksPlus  Spell, 
p.  16  Coming  events, 
p.  17  What's  news, 
p.  19  Fine  print 
p.  20  In  future  issues. 


ESRC  data  archive  bulletin 
nr.  38  September  1987 

News: 

p.  1  British  Library  research  and  development  award. 

p.  1  A  new  structure  for  the  ESRC. 

p.  1  Seminar  on  the  Online  Information  Retrieval  System. 

p.  2  Dissemination  of  research  results. 

p.  2  GHS  update. 

p.  3  Rural  areas  database  update. 

p.  3  New  offices. 

p.  3  New  staff. 
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New  acquisitions: 

p.    4  Selected  list  of  new  acquisitions. 


4  Health  evaluations. 

4  Social  identity'  and  the  understanding  of  economic  change. 

4  Women  and  leisure:  constraints  and  opportunities. 

5  Greenwich  open  space  project:  household  survey  and  group  transcripts,  1986. 

5  Public  lending  right  loans  data. 

6  Updates  to  serial  holdings. 


Research  organisations,  data  institutions  and  foreign  archives: 
p.    6  Northern  regional  research  laboratory  -  Lancaster. 

Software  bulletin- 

Notes: 

p.    7  The  Chorley  report  on  geographical  data  handling. 

p.    7  Why  Britain  needs  social  science  -  ALSISS 

p.    8  Survey  methods  newsletter. 

p.    8  Population  trends. 

p.    9  Social  research  association. 

p.    9  Information  management 

p.    9  Area. 

p.    9  Employment  gazette. 

p.    9  IASSIST  quarteriy. 

p.  10  European  political  data  newsletter. 

p.  10  BURISA 

p.  10  Government  computing. 

p.  10  SGCSA 

p.  10  Human  factors  in  computing  systems  and  graphics  interface. 

p.  10  ESRC  newsletter. 

p.  11  Statistical  news. 

p.  11  Assignation. 

p.  11  Research  on  criminal  justics  and  community  relations. 

p.  11  Essex  papers  in  politics  and  government 

p.  11  Course  for  social  scientists. 

p.  12  Forthcoming  events. 

Books: 

p.  15  Changing  the  definition  of  a  household,  by  Todd  and  Griffiths. 

Book  notes: 

p.  15  Nine-year-olds  grow  up,  by  S.    Mitchell 

p.  16  Personal  bibliographic  indexes  and  their  computerisation,  by  R.    Heeks 

p.  16  The  economics  of  online,  edited  by  P.    Bysouth 

p.  16  The  television  audience:  patterns  of  viewing  (an  update),  by  G.    Goodhardt  A.S.C.    Ehrenberg 

and  M.A.    Collins 
p.  16  Text  retrieval:  a  directory  of  software,  ed.  by  R.    Kimberly 
p.  16  The  future  of  the  metropolis:  Berin-London-Paris-New  York:  economic  aspects,  ed.  by  H.-J. 
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Ewers,  J.B.    Goodard  and  H.    Mazerath 
p.  16  The  quality  of  urban  life,  ed.  by  D.    Frick 
p.  16  Matrices  for  statistics,  by  M.J.R.    Healy 

p.  16  Information  theory:  structtrral  moels  for  qualitative  data,  by  K.    KlippendorfT 
p.  17  A  handbook  for  social  science  research,  by  B.R.    Dixon,  G.D.    Bouma  and  G.B.J.    Atkinson 

Appendices: 

p.  17  A:  Recent  additions  to  the  ICPSR  archive  at  Michigan. 

p.  18  B:  Datasets  acquired  since  the  publication  of  Bulletin  37. 


Scope:  humanities  computing  update 
vol.  5(4),  Jul-Aug  1987 

p.  37  Libraries  and  computer  centers:  a  merging  of  power  bases. 

p.  38  Software. 

p.  39  Courseware. 

p.  40  Publications. 

p.  41  Software  reviews. 

p.  42  Databases. 

p.  43  Hardware  reviews. 

p.  44  Call  for  papers.    Campus  News.    Networks. 

p.  45  Printing  with  non-standard  alphabets./  Ken  Br>'ant  Quotes. 

p.  46  Computers  for  learning:  an  overview  of  Project  Pegasus./  Diane  Balestri. 

p.  47  Calendar. 


Survey  methodologj 
vol.  l'3(l),  June  1987 

p.    1  Telephone  sample  designs  for  the  U.S.  black  household  population./  K.M.    Inglis,  R.M. 

Groves  and  S.G.    Heeringa 
p.  15  Comparing  telephone  and  face-to- face  interviewing  in  the  United  Kingdom./  W.M.    Sykes  and 

M.    Collins 
p.  29  Issues  in  the  use  of  administrative  records  for  statistical  purposes./  G.J.    Brackstone 
p.  45  Statistical  properties  of  crop  production  estimators./  C.A.    Francisco,  W.A.    Fuller  and  R. 

Fecso 
p.  63  Current  issues  on  seasonal  adjustment/  E.B.    Dagum 
p.  75  On  efficient  estimation  of  unemployment  rates  from  Labour  Force  Survey  data./  S.    Kumar 

and  A.C.    Singh  p.  85  A  sampling  procedure  with  inclusion  probabilities  proportional  to  size./ 

A.    Dey  and  S.K.    Srivastava 
p.  93  Sample  design  for  the  health  and  activity  limitation  survey./  D.    Dolson,  K.    McClean,  J.-P. 
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Morin,  and  A.    Theberge 
p.  109  Comparison  of  estimators  of  population  total  in  two-stage  sucessive  sampling  using  auxiliary^ 

information./  F.C.    Okafor 
p.l23  Corrigendum 


Data  user  news  from  the  Bureau  of  the  Census 
vol  22(9),  Sept.,  1987 

p.    1  New  catalog  and  guide  highlights  Census  Bureau  data  products  and  services.    Lower  payments 
for  child  support  p.    2  U.S.  and  Canada  sign  agreement  to  improve  export  data.  p.    3 
Telephone  contacts:  Bureau  of  the  Census,  p.    7  States'  share  of  exports  now  more  precisely 
measured.    Latest  count\'  population  estimates  on  CENDATA(tm).    Demand  for  CE>-ROM? 
p.     8  U.S.  statistics  at  a  glance.    New  SIC  codes. 


Data  user  news  from  the  Bureau  of  the  Census 
vol  22(11),  Nov.,  1987 

p.     1  New  report  focuses  on  minorit>'  business  owners. 

p.    2  Growth  in  number  living  alone  slackens. 

p.    3  Women  making  headway  in  male-dominated  professions. 

p.    4  Connecticut  had  highest  proportion  of  labor  force  in  export-related  jobs.    Census  booklet 

celebrates  constitution  bicenteimial. 
p.    5  TIGER,  map  information  now  online!    Exporters  get  new  electronic  reporting  assistance, 
p.    6  Population  exlimates  for  counties  and  metro  areas.    Measuring  English  language  proficiency, 
p.    7  Construction  boom  in  Midwest    Housing  starts  in  decline,  may  have  stabilized, 
p.    8  ASA  annual  meeting:  Census  Bureau  staff  discuss  research  on  statistical  trends  and  methods, 
p.  10  November  '86  voter  tiunout  down  from  '82.    Understanding  and  using  economic  census  data, 
p.  11  Federal  data  highlights, 
p.  12  U.S.  statistics  at  a  glance. 


Conununications  of  the  A.C.M. 
V.30(ll)  November  1987 

Articles: 

p.909  ARIADNE:  Pattern-directed  inference  and  hierarchical  abstraction  in  protein  structure 

recognition./  Richard  H.    Lathrop,  Teresa  A.    Webster  and  Temple  F.    Smith 
p.922  A  plausibility-driven  approach  to  computer  architecture  design./  Ulises  /\Jiuero  and  Subrata 
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Dasgupta 
p.933  Markup  systems  and  the  future  of  scholarly  text  processing./  James  H.    Coombs,  Allan  H. 
Renear  and  Steven  J.    DeRose 

Computing  practices: 

p.948  A  proposed  solution  to  the  problem  of  levels  in  error-message  generation./  Kemal  Efe 
p.956  The  partial  metrics  system:  modehng  the  stepwise  refinement  process  using  partial  metrics./ 
Robert  G.    Reynolds 

Research  contributions: 

p.964  The  vocabulary  problem  in  human-system  communications./  G.W.    Furnas.    T.K.    Landauer, 

L.M.    Gomez  and  S.T.    Dumais 
p.972  The  relationship  of  MIS  steering  committees  to  size  of  firm  and  formalization  of  MIS 

planning./  William  J.    Doll  and  Golamreza  Torkzadeh 


Social  science  microcomputer  review 
V.5(4)  Winter  1987 

Symposium  on  the  state  of  the  art  of  social  science  computing: 

p.    V  Introduction./  David  Garson  p.439  Computing  in  sociology:  bringing  back  balance./  Kenneth  E. 

Hinze. 
p.452  Anthropological  computing  in  the  mid-1980s./  Dennis  O'Neil 

p.466  Computer  use  by  psychologists./  Michael  L    Stoloff,  James  V.    Couch  and  Sarah  Riley 
p.476  Instructional  microcomputing  in  economics./  William  P.    Yohe,  Robert  E.    Schenk  and  William 

B.    Walstad 
p.485  Computers  in  political  science./  Stephen  Frantzich  and  Helen  Purkitt 
p.506  Computer  applications  in  public  administration:  increased  capabilit>'  at  lower  cost/  Thomas  C. 

Foss 
p.514  Historians  and  computing,  circa  1987./  James  B.M.    Schick 
p.529  The  evolving  statistics  and  research  process  using  microcomputer  software./  Edwin  H. 

Carpenter 
p.545  Microcomputer  expert  systems./  Carl  Grafton  and  Anne  PermalofT 
p.558  News  and  notes. 

Software  reviews: 

p.581  PSYCOM:  psychology  on  computer.    (Janet  L.    Kottke) 

p.584  START.    (Carole  L.    Ricketts) 

p.585  Multidimensional  personality  evaluation.    (David  Anderson) 

p.585  Professional  personality  invenlor>'.    (Larry  D.    Hilgert) 

p.586  Statistics:  demonstrations  and  tutorials.    (William  F.    Bengston) 

p.588  Probabilit)'  demonstrations  and  tutorials.    (J.A.F.    NichoUs) 
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Book  reviews: 

p.591  The  dossier  society./  Kenneth  C.    Laudon 

p.595  Computing  in  psychology:  an  introduction  to  programming  methods  and  concepts./  James  H. 
Reynolds 


Machine  Readable  Records  bulletin 
vol.  5(3).  Fall  1987 

p.    1  CULDAT  record  formaL 

p.    2  ESRC  seminar  series  on  the  bibhographic  control  of  computer  files. 

p.    2  Bureau  of  Canadian  Archivists  and  descriptive  standards. 


Computational  statistics  &.  data  analysis 
V.6(l)  January  1988 

Section  I:  Methodology 

p.     1  An  algorithm  for  exact  evaluation  of  multivariate  functions  and  their  derivatives  to  any  order./ 

A.S.    Wexler 
p.    7  Fitting  generalized  linear  models  and  their  nonlinear  extensions  with  least-squares  calculations./ 

W.D.    Stirling 
p.  21  Maximum  likelihood  estimation  in  models  with  two  systematic  parts./  W.D.    Stirling 
p.  39  The  sensitivity  of  the  one-sample  and  two-sample  Student  t  statistic./  G.C.    Canavos 

Section  II:  Applications  and  comparative  studies 

p.  47  Efficiencies  in  microcomputer  simulation  experiments./  V.F.    Flack 

Section  III:  Notes,  announcements  and  reviews 

p.  59  A  note  on  the  in  variance  property  of  signal  detectors./  D.R.    Halversen  and  G.L.    Wise 

p.  65  A  review  of  MicroTSP,  a  software  for  econometric  analysis./  A.  ten  Gate  and  J.    Middledorp 

p.  71  Review  of  SPSS  PC+  statistical  package./  M.E.    Vogel 

p.  85  Announcements 


Fall /Winter   1987 


iassisl   quarterly  _    io9 


Scope:  humanities  computing  update 
vol.  5(5),  Sept.- Oct.,  1987 

p.  49  20/20  hindsight  in  computerland. 

p.  50  Software.    Networks. 

p.  51  Courseware.    Software  reviews. 

p.  53  Publications. 

p.  54  Campus  news.    Hardware.    Quotes. 

p.  55  Interactive  learning  technology:  laserdiscs  as  information  bases./  Lajr>-  Katz 

p.  56  Databases. 

p.  57  The  future  of  academic  computing:  faculty  work  stations  at  Indiana  Universit>'./  Polly  A. 

McClure 
p.  58  The  Scholar's  Toolbox:  some  generic  software  tools  and  some  guides  to  their  use./  Patricia 

Galloway 
p.  59  Calendar. 


Communications  of  the  A.C.M. 
V.31(l)  Januan  1988 


Articles: 

p.  10  Parlog86  and  the  dining  logicians./  G.A.    Ringwood 

p.  26  A  view  of  the  origins  and  development  of  Prolog./  J.    Cohen 

p.  38  The  early  years  of  logic  programming./  R.A.    Kowalski 

Computing  practices: 

p.  44  A  fair  Share  scheduler./  J.    Kay  &  P.    Lauder 

p.  56  Anatomy  of  a  compact  user  interface  developmnt  tool./  J.W.    Stott  &  J.E.    Kottemann 

Research  contributions: 

p.  68  Computer  technology  and  jobs:  an  impact  assessment  model./  R.    Weber 
p.  78  Computer  backup  pools,  disaster  recovery,  and  default  risk./  Y.    Kahane,  S.    Neumann  &  C.S. 
Tapiero 
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Data  user  news  from  the  Bureau  of  the  Census 
vol.  22(12),  Dec.  1987 


1  One  million  plus:  San  Diego  and  Dallas  join  the  ranks. 

2  Latest  population  and  income  estimates  for  places  and  counties.    Countv'  estimates  of 

households  in  1985. 

3  Census  Bureau  will  not  adjust  '90  census  data.    Where  do  the  questions  come  from? 

4  Twc^thirds  of  U.S.  workers  have  pension  coverage. 

5  Meeting  the  need  for  statistics  on  industrial  automation. 

5  Computer  file  profiles  scientists  and  engineers.    Previewing  the  '87  economic  censuses. 

7  College  grad  income  nearly  double  that  of  high  school  grads.    Getting  the  goods  from  GPO. 

9  Only  five  states  have  more  males  than  females.    'Mr.    Housing'  retires. 

10  More  Census  Bureau  staff  papers  at  the  American  Statistical  Association  conference. 

11  Over  one-half  of  state  expenses  are  for  education  and  welfare. 

12  Data  user  news  annual  index:  January-December  1987. 


Electoral  studies 
V.6(3)  December  1987 

p.l95  General  election  forecasting  in  Britain:  a  comparison  of  three  simple  methods./  A.    Mughan 
p.209  The  impact  of  electoral  formulae  on  the  creation  of  majority  govenmients./  A.    Blais  &  R.K.. 

Carty 
p.219  The  Icelandic  pariiamentary  election  of  1987./  O.T.    Hardarson  &  G.H.    Kristinsson 
p.235  The  Maltese  general  election  of  1987./  S.    Howe 
p.249  The  Fiji  general  election  of  1987./  V.    Lai 

p.263  Notes  on  recent  elections:  the  British  general  election  of  1987./  P.    Cozens  &  K.    Swaddle 
p.267  The  Italian  general  election  of  1987./  D.    Hine 
p.271  The  Finnish  general  election  of  1987./  S.    Berglund 
p.273  The  South  Africa  House  of  Assembly  election  of  1987./  A.  du  Toit 
p.279  The  Malaysian  general  election  of  1986./  A.M.    Hanafiah 
p.286  The  Gambian  presidential  and  pariiamentary  elections  of  1987./  J.A.    Wiseman 
p.289  Book  note 
p.292  Guide  to  journal  articles 
p.299  National  elections 
p. 303  Annual  index 
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ICPSR  Bulletin 

(Inter- University  Consortium  for  Political  and  Social  Research) 

Dec.  1987 

p.    1  The  general  social  survey:  update  on  a  national  data  resource  for  the  social  sciences.  /  Duane 

F.    Alwin 
p.    4  Additions  to  holdings: 

General  social  surveys,  1972-1987:  cumulative  file. 

Current  population  survey:  annual  demographic  file,  1987. 

Population  estimates  by  county  with  components  of  change,  1985. 

Net  migration  of  the  population  of  the  United  Sutes  by  age,  race  and  sex,  1970-1980. 

Centre-periphery  structures  in  Europe:  1880-1978. 

ABC  News/Washington  Post  exit  poll,  1986. 

ABC  News  DanilofT  freedom  poll,  September,  1986. 

ABC  News/Washington  Post  summit  poll,  October  1986. 

ABC  News  Iran  poll,  November  1986. 

ABC  News  Reagan  press  conference  poll,  November  1986. 

National  health  interview  survey,  1984-1986:  longitudinal  study  of  aging,  70  years  and  over. 

Consolidated  federal  funds  report  (CCRF),  fiscal  year  1986. 

County  population  estimates  (experimental)  by  age,  sex,  and  race:  1980,  1982  and  1984. 

Census  of  population  and  housing,  1980  [United  States]:  summary  tape  file  3d  congressional 

district-level  extract 

Recidivism  among  young  parolees:  a  study  of  inmates  released  from  prison  in  22  states,  1978. 

Effects  of  prison  versus  probation  in  California,  1980-1982. 

Prosecution  of  felony  anests,  1982:  Sl    Louis. 

Prosecution  of  felony  arrests,  1982:  Portland  Oregon  and  Washington,  DC 

National  evaluation  of  the  community  anti-crime  program,  1979-1981. 

Survey  of  consumer  attitudes  and  behaviour,  November  1976. 

Survey  of  consimier  attitudes  and  behaviour,  Febniary  1977. 

Survey  of  consimier  attitudes  and  behaviour.  May  1977. 

Survey  of  consimier  attitudes  and  behaviour,  August  1977. 

Survey  of  consumer  attitudes  and  behaviour,  November  1977. 

Party  elites  in  the  United  States,  1974:  Democratic  mid-term  conference  delegates  and 

Sanford  Commission  members. 

Party  elites  in  the  United  States,  1975:  Democratic  national  convention  delegates. 

Survey  of  city  council  members  in  large  American  cities,  1982. 

National  health  and  nutrition  examination  survey  II:  1976-1980:  physician  examination,  ages  6 

mos  -  74  yrs. 

Aging  in  the  eighties:  America  in  transiton,  1981. 
p.  13  Notes  and  errata 

p.  15  Serial  data  collections:  continuous  updates, 
p.  18  Other  announcements:  1988  ICPSR  summer  program  tentative  schedule) 


Fall/Winter  1987 


112  —  iassist   quarterly 


ESRC  data  archive  bulletin 
N.39  Januan  1988 


News: 

p.  1  Editor's  note 

p.  1  A  new  charging  policy  for  archive  services 

p.  2  Chorley  report:  the  government's  response 

p.  2  British  election  survey  data  distribution 

p.  3  GHS  update:  GHS  Newsletter  No,4 

p.  4  Rural  areas  database 

p.  4  New  staff 

New  acquisitions: 

p.    5  Selected  list  of  new  acqtiisitions 

p.    7  Updates  to  serials  holdings 

Research  organisations,  data  institutions  and  foreign  archives: 

p.     8  The  proposed  University  of  Essex  Institute  for  Social  Science  Research 

p.    9  ICPSR  News 

p.    9  Research  report:  computer  based  index  to  the  Irish  Ordnance  Survey  Memoirs 

Software  bulletin: 

Notes: 

p.    9  Customer-selected  results  from  the  1991  population  census  using  IICBS  and  highly  secure 

technology./  D.    Rhind  &  M.    Higgins 
p.  11  SIGCHI  Bulletin 
p.  11  Statistical  news 
p.  11  Government  computing 
p.  11  Area 

p.  12  ZUMA  newsletter 
p.  12  BURISA 
p.  12  ESRC  newsletter 
p.  12  Stirvey  methodology 
p.  12  Population  trends 
p.  12  European  political  data  newsletter 
p.  12  Employment  gazette 

p.  13  Survey  methods  for  social  researchers:  course  armouncemeni 
p.  13  Assignation 

p.  13  Essex  papers  in  politics  and  government 
p.  13  Software  for  statistical  and  survey  analysis  register 
p.  13  SGCSA  prize  for  computing  paper 
p.  13  Luxemboiu'g  income  study  summer  workshop 
p.  14  Forthcoming  events 
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Books: 

p.  18  A  guide  to  SPSS/PC+./  Neil  Frude 

The  SPSS  guide  to  data  analysis  instructor's  manual./  M.    Norusis 
p.  19  On  the  record.    Sur\'eillance,  computers  &  privacy  -  the  inside  story./  D.    Campell  &  S. 

Connor. 

Book  notes: 

p.  19  British  social  attitudes,  ed.  by  Jowell,  Witherspoon  &.  Brooks 

p.  19  Elections  and  voters,  by  M.    Harrop  and  W.    Miller 

p.  20  Ideology,  strategy  and  party  change:  spatial  analyses  of  post-war  election  programmes  in  19 

democracies,  ed.  by  Budge,  Robenson  and  Hear) 
p.  20  Online  information  retnevaJ  in  pracuce,  ed.  by  L    Dorrington 
p.  20  Data  collection  in  developing  counmes,  by  D^    Casley  and  D.    Lury 

Appendices: 

p.  21  Recent  additions  to  the  ICPSR  archive  a:  Michigan 
p.  22  Datasets  acquired  since  the  publication  of  B'olieun  38 


Conimunications  of  the  ACM 
V.31  N.2  Februan  1988 


Articles: 

p.UO  The  an  and  science  of  visualizing  data./  K.A.    Frenkel 
p.l22  Coons  award  lecture./  D.P.    Greenberg 
p.l30  Ray  traang  Jell-0  brand  gelatin./  P.S.    Heckben 
p.l36  Pattern  formation  and  chaos  in  networks./  C.A.    Pickover 
p.l52  Connectionist  expen  systems./  S.l.    Gallant 

p.170  Computing  with  structured  connectionist  networks./  J.A.    Feldman,  M.A.    Fant)',  N.H.    Goddard 
&  K.J.    Lynne 

Computing  practices: 

p. 188  NPATH:  a  measure  of  execution  path  complexit>'  and  its  applications./  B.A.    Nejmeh 
p. 202  Parallel  text  search  methods./  G.Salion  &.  C.    Buckley 

Research  contributiom: 

p.216  Binomial  random  vanaie  generation.    \'.    Ivachitvichyanukul  &  B.W.    Schmeiser 

Departments: 

p.  99  Authors 

p.lOO  Fdiiori.il 

p.  101   ,'\CM  vKc-prcsulcni's  Iciicr 

p.  104  l\>lii.'\   discussion 
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p.l06  ACM  press  books 

p.224  Calendar  of  events 

p.228  Calls  for  papers 

p.230  ACM  news 

p.232  General  news  and  notes 

p.233  Standards 

p.234  Offiaal  ACM  news 

p.237  Corrigendum 

p.238  ACM  open  business/technical  meetings  at  CSC 


European  political  data  newsletter 
No.65  December  1987 

p.  4  Editorial  note 

Data  section- 

p.  5  Complete  censuses  and  samples./  L.    Kish  &  V.    Verma 

p.  18  Which  cotmtries  will  follow  the  Scandinavian  lead  in  taking  a  register-based  census  of 
population?/  P.    Redfem 

Archive  nens: 

p.  27  The  Soviet  Union:  the  all-imion  sociological  data  bank  (AUSDB). 

p.  29  Himgmfo:  Hungarian  social  sciences  and  humanities. 

New  publications: 

p.  30  Recent  additions  to  the  series  of  Essex  papers  in  politics  and  govemmenL 

p.  34  New  directory  of  political  scientists  in  Britain 

Computer  section- 

p.  35  Information  systems  for  urban  and  regional  planning.    An  abstract  of  a  SORSA  paper  prepared 

in  coordination  with  UNESCO./  J.P.  van  Est,  U.    Maack.  S.W.    Witiuk  &  F.R.    Broome 
p.  67  Developing  a  GIS  for  the  European  Community  with  the  CORINE  programme 


NSD  brukennelding 

(Norwegian  Social  Science  Data  Service) 

1988:1,  Februarj'  1988 

[Data  file  announcements.] 

p.    2  Health  survey  1985.  [in  Norwegian] 
Youth  survey  1985.  [in  Norwegian] 
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p.  3  Biographies  of  Storting  delegates  1945-1985.  [in  Norwegian] 

p.  4  ICPSR  Slimmer  School  in  Ann  Arbor,  [in  Norwegian] 

p.  6  Census  of  population  and  housing  1990.  [in  Norwegian] 

p.  9  Qualit\-  of  life  bibliography,  [in  Norwegian] 

Additions  to  the  Kommunedatabas.  [in  Norwegian] 

p.  12  Survey  data  files  in  the  NSD  collections,  [in  Norwegian] 

p.  13  The  Norwegian  values  survey,  [in  Norwegian] 

Books: 

p.  13  Stat,  nasjon,  klasse.    Essays  i  polidsk  sosiologi./  Stein  Rokkan.  1987  [in  Norwegian] 

p.  14  Arbeidsmarkedsprognoser./  J.E.    Askildsen  &  E.    Waemess,  1987.  [in  Norwegian] 


NSD  brukermelding 

(Norwegian  Social  Science  Data  Service) 

1988:2,  March  1988 

p.    2  Data  on  municipal  elections,  [in  Norwegian] 
p.    3  Municipal  census  data  for  1985.  [in  Norwegian] 

Essex  Summer  School,  [in  Norwegian] 
p.    5  IASSIST  88  [in  Norwegian] 

p.    6  Documentation  of  NAVF  and  NORAS  research  projects,  [in  Norwegian] 
p.  10  Cooperative  European  teaching  acdvides.  [in  Norwegian] 
p.  11  NSD  contract  with  the  European  Council,  [in  Norwegian] 
p.  11  Research  in  mass  communicauon.  [in  Norwegian] 
p.  12  Research  in  development  [in  Norwegian] 

New  codebooks  from  ICPSR.  [in  Norwegian] 
p.  13  Computers  in  the  schools,  [in  Norwegian] 
p.  14  Books:  [in  Norwegian] 

Nesset  kommune  gjennom  150  aar./  M.    Angvik. 

Eldreboelgen  -  en  politisk  utfordring./  K  Hesselberg 

Norwegian  local  tax  policy./  T.    Hansen. 


Bits  &  bytes  review 
vol.  1(7),  June  1987 


Data  base  reviews: 

p.    1  From  Homer  to  Hesychius  -  the  Thesaurus  Linguae  Graecae  Project    John  J.    Hughes. 

p.    7  From  Torah  to  Talmud  to  today  -  Responsa  Project/global  Jewish  database.    John  J.    Hughes. 
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p.  13  Coming  events 
p.  14  What's  news 
p.  17  Fine  print 
p.  20  In  future  issues. 


SSD  kontakt 

(Swedish  Social  Science  Data  Service) 

1-2/1987 

p.     1  International  cooperation  amongst  academic  data  archives,  [in  Swedish] 

p.    4  Report  on  the  IASSIST  conference  in  Vancouver,  May  19-22,  1987.  [in  Swedish] 

p.    7  The  1988  IASSIST  conference,  [in  Swedish] 

p.    8  The  first  CESSDA  Expen  Seminar,  Odense,  SepL  16-29,  1987.  [in  Swedish] 

p.  10  ICPSR  Summer  School  1988.  [in  Swedish] 

p.  11  Decline  pf  partisanship  in  western  industrialized  nations./  Howard  L.    Reiter. 

p.  14  Meeting  of  the  SSD  users'  group,  September  24,  1987.  [in  Swedish] 

p.  17  IMF  data  on  magnetic  tape./  Boo  Sjoeoe  [m  Swedish] 

p.  21  The  Euro-barometer  series,  [in  Swedish] 

p.  26  Class  structure  in  five  countries,  [in  Swedish] 

p.  27  'Mini-catalogue'  of  SSD-studies. 

p.  38  New  foreign  surveys  in  SSD's  collection,  [in  Swedish] 

p.  39  Recently  processed  Swedish  data  files,  [in  Swedish] 

SSD  0103  Demographic  and  economic  time-series  for  Swedish  municipalities. 

SSD  0214  Survey  on  democracy  versus  bureaucracy  in  foreign  and  domestic  policy. 

SSD  0208  Referendum  study  1957. 

SSD  0203  Referendum  study  1980. 

SSD  0209  Archive  of  popular  movements,  1881-1945. 

SSD  0212  Comparative  socio-political  attitudes. 

SSD  0210  Socio-political  attitudes  1975. 

SSD  0211  Socio-political  attitudes  1979. 
p.  47  SSD  class  I  data  files,  [in  Swedish] 
p.  49  English  summary. 


DD.'\  nyt 

(Danish  Data  .'Archives)  nr.  42,  summer  1987 

p.     3  Ediiona.;. 

p.     5  Ainendmenis  lo  the  Danish  Data  Protection  Act,  June  1987./  Per  Nielsen  [in  Danish] 

p.  2"  Tne  Max  Pianck  Histoncal  Instirute./  Hans  J.    Marker,  [in  Danish] 

p.  35  Newly  processed  data  files:  [in  Danish] 
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DDA-0658  Danish  election  studies,  continuitv-  file  1971-1981. 

DDA-1052  OBSERVA  political  index  polls,  1984. 

DDA-1053-1064  OBSERVA  political  index  polls,  Jan.-Dec.  1984. 

DDA-1069  Experienced  workers  and  new  technology,  1983. 
p.  49  New  data  files:  [in  Danish] 

DDA-1237  Danish  leisure  study,  1987:  background  variables. 

DDA-1242  Danish  leisure  study,  1987:  adult  questionnaire. 

DDA-1243  Danish  leisure  study,  1987:  adult  telephone  interviews. 

DDA-1244  Danish  leisure  study,  1987:  daily  time  use. 

DDA-1245  Danish  leisure  study,  1987:  child  questionnaire. 

DDA-1246  Danish  leisure  study,  1987:  child  interviews. 

DDA-1347  Danish  omnibus  survey,  January  1987. 

DDA-1348  Danish  omnibus  survey,  April  1987. 

DDA-1349  Social  networks  and  care  of  old  people,  1987-1988. 

DDA-1350  Maltieamieni  of  old  people  -  failing  caie,  1987-1988 

.  DDA-1351  Development  of  trade  unions  in  Europe  after  World  War  II. 

DDA-1347  Danish  omnibus  survey,  January  1987. 
p.  57  English  summary, 
p.  73  Order  form,  [in  Danish] 


Humanistiske  data 

(Norwegian  Computing  Centre  for  the  Humanities) 

3-87 


4  Editorial. 

6  BBC  advanced  interactive  video  and  the  Domesday  discs./  Phyllis  S.    Gove 
15  The  North  West  Educational  Computing  Project/  Ian  Robertson  &  Mike  Picciotto 
22  Recording  the  Italian  cultural  heritage./  Ernesto  Bartolozzi 
33  A  CD-ROM  based  geographic  information  system./  Erling  Maartmaim-Moe 
42  Introducing  a  new  tool  in  foreign  language  teaching./  Bjom  Dommarsnes  [in  Norwegian] 
50  Dialect  expert/  Torbjom  Nordgard  [in  Norwegian] 
64  Current  methods  for  automatic  morphological  analysis  and  possible  applications  in 

word-processing  systems  of  the  future./  Tove  Fjeldvig  [in  Norwegian] 
82  'Humanistic  informatics'  in  Denmark./  M.B.    Laisen  [in  Norwegian] 


Reports 

p.  89  Interactive  video  in  foreign  language  teaching./  R.O.    Andersen  [in  Norwegian] 

p.  99  RD-BIB./  Eli  Schilbred  [in  Norwegian] 

p.  107  Multi-Lingual  Scholar  (TM)  version  3.0./  Torrey  Seland  [in  Norwegian] 

p.  115  'Humanities  Days'  1987./  Kristin  Nat\'ig  [in  Norwegian]  p.  117  News  [in  Norwegian] 

p.  124  Stimmary 
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ZA  Information 

(Zentralarchiv  fuer  Empirische  Sozialforschung) 

nr.  21,  November  1987 

p.  4  Editorial  [in  German] 

News  from  the  Archive 

p.  5  Newly  acquired  data  files:  [in  German] 

1536  German  election  study  1987:  continuous  monitoring 

1537  German  election  study  1987:  panel  study 
1550  Crime  and  fear  of  criminality  among  students. 

1593  Survey  on  foreign  aid  and  the  politics  of  development,  1979. 

1594  Stu-vey  on  foreign  aid  and  the  politics  of  development,  1981. 

1595  Women,  men,  and  computers. 
1595  Youth,  education,  and  occupation. 

1597  Dialogue  2:  publicity  versus  personal  privacy. 

1599  Survey  of  foreigners  1982:  wave  1:  household  heads. 

1600  Survey  of  foreigners  1982:  wave  2:  household  members. 

1601  General  social  survey  ALLBUS  1984. 

1602  Attitudes  to  technology,  1986. 

1603  Development  of  drug  use  among  youth. 

1604  Transfer  survey. 

1605  Changes  in  traffic  fiow  patterns  1976. 

1606  Changes  in  traffic  fiow  patterns  1982. 
1609  Growth  factors  in  leisure  time  to  1995. 
1611  Continuing  on  to  post-secondan'  education. 

1611  The  progression  of  working  class  children  to  post-secondary  education. 

1613  Marriage  and  the  family.    (1986) 
p.    7  German  election  study  1985:  data  file  and  codebook  now  available,  [in  German] 
p.  11  The  International  Social  Survey  Program  (ISSP).  [in  German] 
p.  14  Expansion  of  the  ALLBUS  data  collection  and  documentation,  [in  German] 

Research  notes: 

p.  18  Research  on  opinion  research./  F.    Neidhardt  [in  German] 

p.  29  Attitudes  on  downtown  Hamburg  :  further  analysis  using  correspondence  analysis./  J.    Blasius 

[in  German] 
p.  52  A  LISREL-test  of  multivariate  differences  in  non  homogeneous  variances  and  covariances.    An 

example  drawn  from  the  foreign  worker  items  in  the  1980  and  1984  ALLBUSes./  S. 

Kuehnel  [in  German] 
p.  59  Factual  questions  and  the  reliability  of  responses.    Sources  of  enor  in  the  data  from  two 

epidemiological  studies  -  the  Frankfurt  mail  survey./  E.    Durin,  R.    Menzel,  W.    Bauhoefer. 

[in  German] 
p.  70  A  second  round  of  discussion  of  problems  in  the  use  of  categorical  data  from  3  wave  panels: 

remarks  on  Jagodzinski's  reply./  R.    Langeheine  [in  German] 
p.  77  Reply  to  Langeheine.  [in  German] 
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News  and  notes: 

p.  81  Meeting  of  archival  experts  in  the  Danish  archives,  [in  Gennan] 

p.  83  Book  review:  Praxis  der  Wahlforschung./  Kort-Krieger  &  MundL  [in  German] 

p.  85  Data  protection  and  data  needs  in  social  research.    A  meeting  of  the  Association  of  German 

Sociologists,  [in  German] 
p.  87  A  position  on  the  development  of  the  data  protection  legislation  for  Nordrhein-Westgalen./ 

E.K.    Scheuch  [in  German] 
p.  92  ZA  spring  seminar:  the  analysis  of  social  networks.    Feb.  2  -  Mar.  11,  1988.  [in  Gennan] 


ZUMA  Nachrichten 

(Zentnim  fuer  Umfragen,  Methoden  und  Analysen  e.V.) 

nr.  21,  November  1987 

p.  iii  In  my  view,  [in  German] 

Essays 

p.    1  Are  the  social  sciences  physical  sciences?  [in  Gennan] 
p.  20  Center  for  Microdata  -  a  new  division  of  ZUMA.  [in  German] 
p.  31  Center  for  Social  Indicators  Research  -  a  new  division  of  ZUMA.  [in  German] 
p.  43  The  weighting  of  sample  surveys:  is  representativeness  possible?  [in  German] 
p.  59  Handling  missing  values  in  logical  expressions  in  SAS  and  SPSS:  beware  of  unexpected  results, 
[in  German] 

Project  reports 

p.  64  Egocentric  networks:  different  research  instruments  -  difTerent  results?  [in  German] 

p.  78  Politicization  and  depoliticization  of  welfare  expectations,  [in  German] 

Publications 

p.  92  ZUMA  publications,  [in  German] 

p.  92  Publications  based  on  ZUMA  meetings,  [in  German] 

Report  on  ZUMA  activities 

p.  93  Conference  on  self-administered  and  telephone  interview  methods,  April  27-30,  1987.  [in 

German] 
p.  94  ZUMA  research:  Current  panel  research  problems,  May  4-7,  1987.  [in  German] 
p.  96  ZUMA  workshop:  Comparative  career  mobility-.  May  18-21,  1987.  in  Gennan] 
p.  96  ZUMA  workshop:  GAUSS,  October  19-23,  1987.  [in  German] 

Coming  ZUMA  activities  for  1988  [in  German] 

p.  97  ZUMA  workshop:  The  theon.-  of  dynamic  systems,  March  14-17  and  Mar.  21-24,  1988. 
p.  98  ZUMA  workshop:  CLUSTAN,  April  12-14,  1988. 

p.  99  ZUMA  workshop:  Software  for  numerical  simulation  and  model  optimization  on  the  PC,  May 
2-5,  1988. 
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p.  100  ZUMA  workshop:  The  analysis  of  categorical  data,  June  27-July  1,  1988. 

p.  101  ZUMA  symposium:  Social  desirability  -  situation  and  context,  July  19-21,  1988 

p.  101  ZUMA  workshop:  Analysis  of  event  data,  July  1988. 

p.  102  ZUMA  workshop:  Causal  modelling  with  EQS.  July  1988. 

p.  103  Visitors  to  ZUMA.  [in  German] 

p.  105  News:  SOZIALWISSENSCHAFTEN-BUS.  [in  German] 


Historical  social  research 
nr.  42/43,  April/July  1987 

Special  issue:  the  multivariate  analysis  of  nominal  and  ordinal  data. 

p.    3  Editorial 

p.    5  Educational  opportimit\-  and  social  mobility  in  the  19th  century  public  sector:  an  overview./  P. 

Lundgreen  [in  German] 
p.    7  Multivariate  analysis  of  nominal  and  ordinal  scale  data  based  on  multidimensional  contingency 

tables./  Juergen  Sensch  [in  German] 
p.  172  Correspondence  analysis  -  a  multivariate  procedure  for  the  analysis  of  qualitative  data./  Joerg 

Blasius  [in  German] 
p.  190  Book  review:  Die  verlorenen  Welten.    Alltagsbewaeltigung  durch  unsere  Vorfahren  -  und 

weshalb  wir  uns  heme  so  schwer  damit  tun.../  Arthur  E.    Imhof.  [in  German] 
p.  191  Data  news  [in  German] 
p.  206  Quantum  information  [in  German] 
p.  214  Forthcoming  conferences  [in  German] 
p.  218  Past  conferences  [in  German] 


Historical  social  research 
or.  44,  October  1987 


Special  issue:  history  of  employment  structure  in  20th  century  Europe. 

p.    3  Editorial 

p.    5  The  regional  stiucttire  of  employment  in  Germany,  1895-1970./  H.    Kaelble  &  R.    Hohls 

p.  36  A  centuPt'  of  change:  trends  in  the  composition  of  the  Italian  labour  force,  1881-1981./  V. 

Zamagni 
p.  98  The  development  of  the  service  sector  in  Germany  and  the  Netherlands  -  a  comparison./  H. 

van  Dijk 
p.  121  A  comparative  historical  study  of  long-term  change  in  the  the  earnings  structtu-e  in  Germany./ 

R.    Stockman  [in  German] 
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Current  research 

p.  127  The  handbook  of  members  of  the  Prussian  Chamber  of  Deputies  1867-1918. 

TUSTEP  in  biographical  research.  /  T.    Kuehne  [in  Gennan]  p 

[in  German] 
p.  148  Forthcoming  conferences  [in  German] 
p.  153  Past  conferences  [in  German] 


The  use  of 
141  Quanttun  information 


NOTICE 

If  your  archive  has  a  written  collection 

development  policy  for  machine- 
readable  data  files,  please  send  a  copy 
to,  or  contact  Dan  Tsang,  Main  Library, 

University  of  California, 

PO  Box  19557,  Irvine  CA  92713,  USA. 

Tel:  (714)  856-4978 

Email:  dtsang@orion.cf.uci.edu 

He  is  researching  such  policies  at 
academic  libraries. 
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Australian  election  survey,  1987 


Background 

This  survey  of  1,825  respondents  is  the  first  purpose-designed  national  election  study  undertaken  in 
Australia.    It  has  two  goals.    The  first  is  to  continue  the  broad  line  of  inquiry  established  by  the 
1967  and  1979  Australian  National  Political  Attitudes  Surveys  into  the  long-term  character  of  national 
political  attitudes  and  behaviour.    The  second,  and  more  novel,  is  to  focus  more  clearly  on 
short-term  forces,  like  issues  and  leaders,  specific  to  the  1987  election.    The  principal  investigators 
are  Roger  Jones  of  the  Austrjilian  National  Universit}-,  Ian  McAllister  of  the  University  of  New 
South  Wales  and  Anthony  Mughan  formerly  of  the  Australian  National  Universit>'  and  now  of  the 
Ohio  State  University.    The  survey  was  funded  by  the  Australian  National  Universit>'  and  the 
Universit>'  of  New  South  Wales. 

The  Survey 

A  systematic  random  sample  of  3,061  respondents  was  drawn  from  the  electoral  rolls  at  the 
Australian  Electoral  Commission  two  weeks  before  the  election  of  11  July  1987.    The  self-completion 
(mail  out,  mail  back)  questionnaire  comprises  195  variables  divided  into  six  broad  sections:  The 
Federal  Election,  Political  Leaders,  Election  Issues,  Social  and  Political  Goals,  Education  and  Work 
and  Personal  Backgroimd. 

The  Federal  Election  section  contains  questions  on  concern  about  the  election  outcome,  the  influence 
of  the  media,  party  identification,  vote  in  the  1984  and  1987  elections,  the  timing  of  the  voting 
decision  and  the  importance  to  the  vote  of  a  number  of  issues. 

The  Political  Leaders  section  enquiries  about  feelings  towards  the  pariy  leaders  indicated  by  ratings 
from  0  to  10,  about  their  leadership  qualities  and  their  effectiveness  as  Prime  Minister.    It  also 
assesses  perceptions  of  the  unity,  moderation  and  class  appeal  of  the  Labour,  Liberal  and  National 
parties. 

The  Election  Issues  section  is  the  longest,  focussing  on  economic  voting,  privatisation,  government 
spending,  trade  union  and  big  business  power,  uranium  mining,  law  and  order,  Asian  migration, 
marijuana,  abortion,  aborigines,  testing  for  AIDS,  censorship,  homosexualit}',  the  environment  and 
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opportunities  for  women. 

The  Social  and  Political  Goals  section  contains  Inglehart's  postmaterialism  questions,  trust  in 
government  and  modes  of  political  participation  questions. 

Education  looks  at  number  of  years  of  schooling,  state  or  Catholic  school  and  qualifications  obtained. 
Work  asks  about  employment  status,  occupation,  public  or  private  employment,  supervisory 
responsibilities,  trade  union  membership  and  subjective  class. 

The  personal  background  questions  include  sex,  place  and  length  of  residence,  age,  place  of  birth, 
date  of  arrival  in  Australia,  father's  occupation  and  party  preference,  marital  status  and  children, 
spouse's  occupation,  trade  union  membership  and  party  pareference  and  the  respondent's  religion  and 
religious  practice. 

Availability 

The  data  file  and  codebook  are  now  available  from  The  Social  Science  Data  Archives,  The  Australian 
National  University,  GPO  Box  4,  Canberra,  ACT  2601,  Australia.    Telephone  (062)  494400. 
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Career  opportunities 


Job  Opening 

Position  Title:        Associate  Director,  Government  Data  Center 
Harvard  University 

The  Govemmeni  Data  Center  (GDC)  is  the  home  of  the  Government  Department's  microcomputer 
lab  and  Harvard's  primai>'  computerized  social  science  data  and  codebook  archive.    Since  the  director 
of  the  GDC  is  a  member  of  the  Government  Department  faculty-,  the  associate  director  is  the  top 
stafT  position  at  the  GDC  and  is  highly  visible  within  the  Harvard  community.    The  occupant  of  this 
position  provides  technical  advice  and  guidance  to  faculty,  staff,  and  students  on  computational 
problems  related  to  coursework,  research,  and  administration.    Serves  as  consultant  to  other  related 
social  science  departments,  and  acts  as  the  department's  liaison  with  other  departments  and  units  of 
the  University  on  computer-related  issues.    Supervises  the  acquisition  and  cataloging  of  data  sets  and 
codebooks,  and  assists  in  developing  and  implementing  the  GDC's  financial  and  other  policies.    With 
his  or  her  staff,  the  associate  director  has  considerable  fiexibility.    Depending  on  skills  and  interests, 
the  associate  director  is  welcome  to  choose  special  projects  to  work  on;  these  might  include:  writing 
computer-aided  instruction  software,  automating  GDC  operations,  teaching  mini-courses  on  computer 
software  and  hardware,  assisting  instructors  in  classroom  teaching  related  to  computers,  or  writing 
documentation.    Other  ideas  and  special  projects  may  be  developed  by  the  new  associate  director. 

Bachelor's  degree  required:  advanced  degree  preferred.    Applicants  should  have  working  knowledge  of 
IBM  microcomputers  and  associated  software.    Experience  with  word  processing  and  statistical 
software  is  preferred.    Familiarity  with  one  or  more  progranuning  languages  (BASIC,  Fortran,  Pascal, 
etc.)  and  mainframe  operating  systems  are  helpful.    Verbal,  administrative,  and  interpersonal  skills  are 
essential. 

Salary  may  be  in  low  to  high  $30's,  depending  on  qualifications.    Interviews  begin  immediately. 
Please  send  a  vitae  with  references  as  soon  as  possible  to: 

Professor  Gary  King 

Department  of  Government  Phone:  517-495-2027 

Harvard  University  BITNET:  gmkHarvtmxw 

Littauer  Center  M37  Internet:  gmkWjhl2.harvard.edu 

Cambridge,  MA  02138  UUCPnet:  gmkWjhl2 


Fall/Winter   1987 


iassisl   quarterly 


-   125 


Course  offering: 


Management  of  computer-readable  files 


The  University  of  British  Columbia  School  of  Library.  Archival  and  Information  Studies  is  offering  the 
following  Extra  Sessional  course  in  the  Summer  Session.  Registration  is  from  March  12  -  June  28. 


Course: 

Instructor: 

Objectives: 


LIBR  65  IB  -  Management  of  computer-readable  files 

Laine  G.M.  Ruus 

to  acquire  a  basic  working  knowledge  of  how  data  are  stored  in  computer-readable 

form,  and  how  data  should  be  stored. 

to  acquire  experience  w  ith  the  management,  documentation  and  description  of  data 

in  computer-readable  form, 

to  acquire  an  appreciation  of  how  data  are  used,  to  gain  an  appreciation  of  the 

technical  and  ethical  problems  surrounding  the  long-term  storage  and  dissemination 

of  data  files 


Content: 

Basic  readings: 
Method: 

Prerequisite: 
Evaluation: 

Course  credit: 


major  sources  of  data  files 
'dirty  data" 

physical  storage  media 
access  versus  acquisition 


file  structures  and  documentation 
elementary  statistical  analysis 
ethical  considerations:  citation,  copyright,  etc. 
institutional  structures 


students  are  expected  to  select  their  own  reading  material  from  an  on-line  SPIRES 
database 

in  addition  to  lecture  material,  students  will  be  assigned  a  data  file  which  they  will 
take  through  standard  processing  steps,  generate  appropriate  documentation  and 
cataloguing  records,  and  finally  do  basic  statistical  analysis 


will  be  based  on  hands  on  work  with  an  assigned  data  file  and  a  take-home 
examination 

1.5  units. 


For  further  information,  contact: 

School  of  Library.  Archival  and  Information  Studies 
University  of  British  Columbia 
2075  Wesbrook  Mall 
Vancouver,  B.C.  V6T  1W5 
Phone:  (604)  228-2404. 
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University  of  Alberta 

Summer  Institute  on  the 

Quantitative  Analysis  of  Social  Data 

June  20  -  30,  1988 


Level  I  -  Beginning  Statistics 

•  basic  descriptive  statistics 

•  statistical  inference 

•  regression  analysis 

Harvey  Krahn 
University  of  Alberta 


Data  Analysis  Workshop 


Level  II  -  Applied  Regression  Analysis 

•  build  and  test  linear  models 

•  interpret  regression  diagnostics 

•  logit  analysis 

Bob  Arnold 
Queen's  University 

Joint  Sessions 

Computing  &  Data  Analysis 


•  discussion  of  data  analysis  problems 

•  stages  of  survey  data  analysis 

•  data  interpretation 

Bill  Avison 

University  of  Western  Ontario 


•  preparing  data  for  statistical  packages 

•  using  packages  to  clean  data 

•  labs  using  SPSSx  and  SAS 

Chuck  Humphrey  &  Wayne  Watson 
University  of  Alberta 


Sponsors:  Computing  Services  and  the  Population  Research  Laboratory 

Cost:  $650  per  person  (includes  tuition,  course  materials  and  computing  time) 

Further  information:  write  Ilze  Hobin,  Department  of  Sociology,  University  of  Alberta, 
Edmonton,  Alberta  T6G  2H4  or  phone  (403)  432-4659. 
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IASSIST88 


PRELIMINARY  PROGRAM 

IASSIST  '88 

Public  Data:  Use  it  or  Lose  it 

Thursday,  Ma\-  26,  to  Sunday,  May  29,  1988 

Dupont  Plaza  Hotel 

Washington,  DC 

USA 


WEDNESDAY 

4:00  p.m.  to  8:00  p.m. 
REGISTRATION 

DuPont  Plaza  Hotel,  IASSIST  Headquarters  Suite 
THURSDAY,  MAY  26 

8:00  a.m.  to  5:00  p.m. 

REGISTRATION 

Lobby,  DuPont  Plaza  Hotel 

Morning  sessions,  8:30  a.m.  to  12:15  p.m. 

Plenary  session 

Session  1:  View  from  the  National  Statistical  Agencies 

Judith  Rowe,  Princeton  University,  Chair 

William  P.    Bulz,  Associate  Director,  Demographic  Fields 
Area,  U.S.    Bureau  of  the  Census 

Denis  Desjardins,  Director  General,  Marketing  and 
Information  Services,  Statistics  Canada 

Edmund  Rapaport,  Assistant  Director,  Statistics  Sweden 
Concurrent  Sessions 
Session  2- A:  The  Preservation  and  Use  of  Empirical  Economic  Research  Data 

Dan  Newlon,  National  Science  Foundation 

Carolyn  Geda,  ICPSR 
Session  2-B:  Educating  the  Data  User:  The  Role  of  Bibliographic  Instruction 
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Bliss  B.    Siman,  Baruch  College,  City  University  of  New  York,  Chair 
Rona  Ostrow,  Baruch  College,  City  University  of  New  York 
Kristin  McDonough,  Baruch  College,  City  University  of  New  York 
Eleanor  Langstaff,  Baruch  College,  City  University  of  New  York 
Bobbie  Pollard,  Baruch  College,  City  University  of  New  York 
Ida  Lowe,  Baruch  College,  City  University  of  New  York 
Session  2-C:  Digital  Cartographic  Data 

Randle  W.    Olsen,  US  Geological  Survey 

Robert  LaMacchia,  Bureau  of  the  Census,  "TIGER  Files: 

Digital  Canography  for  the  1990  Census." 
Afternoon  Concurrent  sessions,  1:30  p.m.  to  5:00 
Session  3- A:  Data  Analysis  Using  Microcomputers 

Dianne  Crispell,  American  Demographics  Magazine, 

"The  Use  of  Microcomputers  for  Demographic  Analysis: 

An  Overview  of  Options  for  the  Novice  User." 
Ari  Siiva  and  Arthur  Conning,  UN  Latin  American 

Demographic  Centre  (Santiago,  Chile),  "Small-area 

Census  Data  Service  by  Microcomputer:  Application  of 

the  REDATAM  System  in  Latin  America  and  the  Caribbean." 
Charles  Post,  Mathematica  Policy  Research, 

"The  SIPP  Electronic  Codebook." 
S.    W.    Weisman,  National  Technical  Information  Service, 

"Data  Dissemination  on  Diskettes." 
Session  3-B:  Democratization  of  Data 

John  Gales,  John  Gales  Associates 

Betty  Steiger,  Information  Resources  Management  Associates 
Jack  Taub.  National  Information  Utilities 
Session  3-C:  Consumer  Expenditure  Surve\ 

Eva  Jacobs,  Bureau  of  Labor  Statistics 
Stephanie  Shipp,  Bureau  of  Labor  Statistics 
Session  4- A:  Storage  Technology  for  Data  Archives 

Margaret  O.    Adams,  National  Archives  and  Records  Administration,  Chair 
S.    Nazim  Ali,  Bahrain  University,  "Use  of  CD-ROM  in  a 

University  Library." 
Laura  Guy  and  Pat  Wenzel,  University  of  Wisconsin, 

"Data  Storage  on  CD  WORM." 
S.    Pal  Asija,  Attorney  and  Engineer,  Shelton,  Connecticut, 

"Handwritten  Data  Input,  Storage  Output,  and  Recognition  System." 
Session  4-B:  Issues  in  the  Integration  of  Machine-Readable 
Records  into  Traditional  Library  Service 
Jaia  Barrett,  Association  of  Research  Libraries,  chair 
Kathleen  Heim,  Louisiana  State  University 
Karin  Wittenborg,  University  of  California  at  Los  Angeles 
Donald  Treiman,  University  of  California  at  Los  Angeles 
Raymond  Carpenter,  University  of  Nort  Carolina  at  Chapel  Hill 
Session  4-C:  Statistjcal  Use  of  Administrative  Records 

Robert  Dalrymple,  U.S.    Food  and  Nutrition  Service 
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Carole  Trippe,  Mathemalica  Policy  Research,  Inc. 

Michael  Slrudler  and  Peter  Sailor,  Internal  Revenue  Service 
RECEPTION  ~  NATIONAL  ARCHIVES,  5:30  p.m.  -  7:30  p.m. 
FRIDAY,  MAY  27 

8:00  a.m.  to  5:00  p.m. 
REGISTRATION 
Lobby,  DuPont  Plaza  Hotel 
Morning  sessions,  9:00  a.m.  to  12:00  noon 
Plenary  session 

Session  5:  Professionalism  and  Professional  Standards  in  the 
Information  Age 
Constance  Citro,  Committee  on  National  Statistics, 

National  Research  Council,  Chair  and  Moderator 
Barbara  Bailer,  President,  American  Statistical  Association 
Frank  B.    Evans,  President-elect,  Society  of  American  Archivists 
Concurrent  Sessions 

Session  6- A:  The  Development  and  Management  of  Data  Archives 
Daniel  C.    Tsang,  University  of  California  —  Irvine, 

"Collection  Development  Policies  for  Machine-Readable  Data  Files." 
Rudolph  Bell,  Rutgers  University.  "Medieval  and  Eariy 

Modem  Data  Bank." 
Wong  Poh  Kam,  Seres  Sdn.    Bhd.    "Collection, 

Dissemination,  and  Preservation  of  New  Science  Data  in  Malaysia." 
Martin  Pawlocki  and  Elizabeth  Stephenson,  University  of  California 
at  Los  Angeles,  "A  Microcomputer  Based  Tape  Information  System." 
Session  6-B:  Data  Management  Technology 

Derek  Ballantyne,  National  Archives  of  Canada, 

"Using  a  Local  Area  Network  as  a  Front  End  for  Data 
Processing  in  an  Archives." 
Donald  P.    Trees,  Viar  &  Company, 

"A  Generalized  Approach  to  Data  Quality  Assurance  and  Control 
of  Large  Complex  Data  Structures  Using  SAS." 
David  L.    Sallach,  University  of  Arkansas, 

"Knowledge  Interchange  Standards  and  th4e  RM/T  Model." 
Session  6-C:  Cross-National  Social  Surveys 

Roger  Jones,  Australian  National  University,  Moderator/Convenor 
Speakers  will  discuss  content  and  archival  administration  of  the 

European  Values  Survey,  the  International  Social  Survey  Project,  the  Class  Suucture  and 
Class  Consciousness  survey,  and  other  similar  surveys. 
ROUNDTABLE  LUNCHES:  12:00  noon  -  1:30  p.m. 

Complex  Data  Sets  —  How  to  Make  Them  Friendly  and  usable 

—  Convenor:  Michal  Peleg 

Archival  Appraisal  of  Machine-Readable  Statistical  Files 

—  Convenor:  Michael  L.    Miller,  National  Archives 
and  Records  Administration 

Archival  Tools  —  Convenors:  Jacqueline  McGee,  The  Rand  Corporation 
Complex  Data  Sets  —  Convenors:  Pat  Doyle,  Mathematica  Policy  Research 
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and  David  McMillan,  Bureau  of  the  Census 
Afternoon  sessions  1:30  p.m.  -  5:00  p.m. 
Session  7- A,  8- A:  Techniques  and  Tools  of  Microsimulation 
William  Bradley,  Health  and  Welfare  Canada,  Chair 
Steven  Caldwell,  Cornell  University 
Steve  Gribble,  Statistics  Canada 
Sharon  Hirabayshi,  Mathematica  Policy  Research,  Inc. 
Richard  J.    Morrison,  Health  and  Welfare  Canada 
Randall  Webb,  Urban  Institute 
Bruce  Winer,  Carlton  University 
Michael  Wolfson,  Statistics  Canada 
Session  7-B:  On-line  Catalogues  With  Records  of  Computer  Data  Files 
Lynn  Marko,  University  of  Michigan, 

"Generalizing  Access  to  Statistical  Research  Materials." 
Ted  Brandhorst,  Director,  ERIC  Facility, 

"Access  to  Description  of  Education-Related  Machine-Readable 

Data  Files  Via  the  Bibliographic  Data  Base  of  the  ERIC  System." 
Ed  Hanis,  Tycho  Research,  Associates, 

"CULDAT:  The  Canadian  Union  List  of  Machine  Readable  Data  Files" 
Session  7-C:  Editing  Strategies 

Kay  Worrell,  Conference  Board,  Chair 

Speakers  will  discuss  alternative  strategies  for  editing  survey  and  administrative  data.    Of 

interest  is  the  effect  of  program  or  policy  objectives  on  the  choices  for  editing 

incomplete  and  inconsistent  data. 
Session  8-B:  Issues  in  Data  Dissemination 

Katharine  Sue  Gavrel,  National  Archives  of  Canada,  Chair 
E.    Walter  Terry,  Florida  State  University,  "Adopting  a 

Standard  for  Compression  of  Public  Use  Demographic  Data." 
Edie  Hedlin,  National  Archives  and  Records 

Administration,  "New  Regulations  on  Access  to  Highly 

Personal  Information  for  Statistical  Research." 
Session  8-C:  Mapping  as  a  Data  Dissemination  Technique 

Michal  Peleg,  Social  Science  Data  Archive,  Hebrew  University  of  Jerusalem, 

"Presenting  Spatial  Data  -  The  Statistical  Map  as  a  New  Practice." 
Alan  Fox,  Chadwyck-Healey,  Inc., 

"Supermap:  Revolutionizing  the  Way  to  Access  and  Manipulate  Demographic  Data." 
SATURDAY,  MAY  28 

8:30  a.m.  to  11:30  p.m. 
REGISTRATION 
Lobby,  DuPoni  Plaza  Hotel 
Plenan  session,  9:00  -  11:30 
Session  9:  Beyond  the  Social  Science  Data  Archives 
Hans  Jorgen  Marker,  Danish  Data  Archives, 

"History  and  the  Data  Archives." 
Jake  Th.    V.    Knoppers,  Canadian  Workplace  Automauon  Research  Center, 

"The  Archival  Implications  of  Electronic  Document/Data  Interchange." 
David  Bearman,  Archives  and  Museum  Informatics, 
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"Collecting  Software:  A  New  Challenge  for  Archives  and  Museums." 
Erwin  Scheuch,  Universit\-  of  Cologne  Institute  for  Applied  Social  Research, 

"Data  as  Part  of  a  Social  Science  Infrastructure." 
Felix  Krayeski,  Congressional  Research  Service, 

"Storage  and  Dissemination  Technology  for  Data  Archives." 
Lunch  and  Business  Meeting,  11:30  a.m.  -  1:30  p.m. 
1:30  p.m.  -  5:00  p.m.    WORKSHOPS 

(Requires  separate  registration) 
Session  22:  Interchange  Standards  (tentative) 

Organizer:  Ted  Weir,  National  Archives  and  Records  Administration 
Session  23:  Data  Products  From  the  Bureau  of  the  Census: 

1990  Decennial  Data,  Economic  Census  Data,  Foreign  Trade  Data 
Organizer:  Barbara  Aldrich,  Bureau  of  the  Census 
Session  24:  The  Data  Archive  on  Adolescent  Pregnancy  and 
Pregnancy  Prevention 
Organizer:  J.    J.    Card,  Sociometrics  Corporation 
SUNDAY,  MAY  29 

9:30  a.m.  -  12:30  p.m.    WORKSHOPS 
(Requires  separate  registration) 
Session  25:  Integrating  Machine-Readable  Records  into 
Traditional  Library  Senices 
Organizer:  Elizabeth  Stephenson,  University  of 
California  at  Los  Angeles 
Session  26:  Special  Aspects  of  the  National  Logitudinal  Surveys  of  Youth: 
Focus  on  the  NLSY  Child-Mother  Data 
Organizer:  Paula  C.    Baker,  Center  for  Human  Resource  Research, 
The  Ohio  State  Universir.- 
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lASSIST 


Membership 
form 


The  International  Association  for 
Social  Science  Information  Services 
and  Technology  (lASSIST)  is  an 
international  association  of 
individuals  who  are  engaged  in  the 
acquistion,  processing,  maintenance, 
and  distribution  of  machine  readable 
text  and  /or  numeric  social  science 
data.  The  membership  includes 
information  system  specialists,  data 
base  librarians  or  administrators, 
archivists,  researchers,  programmers, 
and  managers.  Their  range  of  interests 
cncompases  hard  copy  as  well  as 
machine  readable  data. 

Paid-up  members  enjoy  voting  rights 
and  receive  the  lASSIST 
QUARTERLY.  They  also  benefit 


from  reduced  fees  for  attendance  at 
regional  and  international 
conferences  sponsored  by 
lASSIST. 

Membership  fees  are: 

Regular  Membership.  $20.00  per 

calendar  year. 

Student  Membership:  $10.00  per 

calendar  year. 

Institutional  subcriptions  to  the 
quarterly  are  available,  but  do  not 
confer  voting  rights  or  other 
membership  benefits. 

Institutional  Subcription:  $35.00 
per  calendar  year  (includes  one 
volume  of  the  Quarterly) 


!    I  would  like  to  become  a  member 
of  Iassist.  Please  see  my  choice 
below: 

$20  Regular  Membership 
$10  Student  Membership 
$35  Institutional 
Membership 
My  primary  Interests  are: 
Archive  Services/ 
Adminisu-ation 
Data  Processing/Data 
Management 
Research  Applications 
Other  (specify) 


1 


Please  make  checks  payable 
to  iassist  and  Mail  to: 

Ms  Jackie  McGee 
Treasurer,  lASSIST 
%  Rand  Corporation 
1700  Main  Street 
Santa  Monica 
California  USA  90406 


Name/phone 


Institutional  Affiliation 
Mailing  Address 


City 


I   Country/zip/postalcode 
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Editorial    Information 


The  lASSIST  QUARTERLY  represents  an  international  cooperative  effort  on  the  part  of  individuals 
managing,  operating,  or  using  machine-readable  data  archives,  data  libraries,  and  data  services.  The 
QUARTERLY  reports  on  activities  related  to  the  production,  acquisition,  preservation,  processing, 
distribution,  and  use  of  machine-readable  data  carried  out  by  its  members  and  others  in  the 
international  social  science  community.  Your  contributions  and  suggestions  for  topics  of  interest  are 
welcomed.  The  views  set  forth  by  authors  of  articles  contained  in  this  publication  are  not  necessarily 
those  of  lASSIST. 

Information  for  Authors 

The  QUARTERLY  is  published  four  times  per  year.  Articles  and  other  information  should  be 
typewritten  and  double-spaced.  Each  page  of  the  manuscript  should  be  numbered.  The  first  page 
should  contain  the  article  title,  author's  name,  affiliation,  address  to  which  correspondence  may  be 
sent,  and  telephone  number.  Footnotes  and  bibliographic  citations  should  be  consistent  in  style, 
preferably  following  a  standard  authority  such  as  the  University  of  Chicago  press  Manual  of  Style  or 
Kate  L.  Turabian's  Manual  for  Writers.  Where  appropriate,  machine-readable  data  files  should  be 
cited  with  bibliographic  citations  consistent  in  style  with  Dodd,  Sue  A.  "Bibliographic  references  for 
numeric  social  science  data  files:  suggested  guidelines".  Journal  of  the  American  Society  for 
Information  Science  30(2):77-82,  l^archi  1979.  If  the  contribution  is  an  announcement  of  a 
conference,  training  session,  or  the  like,  the  text  should  include  a  mailing  address  and  a  telephone 
number  for  the  director  of  the  event  or  for  the  organization  sponsoring  the  event.  Book  notices  and 
reviews  should  not  exceed  two  double-spaced  pages.  Deadlines  for  submitting  articles  are  six  weeks 
before  publication,  ivlanuschpts  should  be  sent  in  duplicate  to  the  Editor: 

Walter  Piovesan,  Research  Data  Library,  W.A.C.  Bennett  Library,  Simon  Eraser  University 
Burnaby,  B.C.,  V5A  1S6  CANADA  (01)604/291-4349  E-fvlail:  USERDLIB@SEU.BITNET 

Book  reviews  should  be  submitted  in  duplicate  to  the  Book  Review  Editor: 

Daniel  Tsang,  Main  Library,  University  of  California  P.O.  Box  19557,  Irvine,  California 
92713  USA.  (01)714/856-4978  E-IVlail:  DTSANG@ORION.CF.UCI.EDU 
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